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Abstract 

Let Yi,i > 1, be i.i.d. random variables having values in an m-dimensional 
manifold M C M d and consider sums Yl?=i £,{ nl ^ d Yi, {n l I d Yi}™ =1 ) , where £ is a 
real valued function defined on pairs (y,y), with y G R d and y C R d locally 
finite. Subject to £ satisfying a weak spatial dependence and continuity condition, 
we show that such sums satisfy weak laws of large numbers, variance asymptotics, 
and central limit theorems. We show that the limit behavior is controlled by the 
value of £ on homogeneous Poisson point processes on m-dimensional hyperplanes 
tangent to Ai. We apply the general results to establish the limit theory of 
dimension and volume content estimators, Renyi and Shannon entropy estimators, 
and clique counts in the Vietoris-Rips complex on {Yi}f =1 . 
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1 Introduction 

There has been recent interest in the statistical, topological, and geometric properties 
of high-dimensional non-linear data sets. Typically the data sets may be modeled as 
realizations of i.i.d. random variables {Yi}" =1 having support on an unknown non- 
linear manifold M. embedded in M. d . Given a sample {1^}™ =1 , whose pairwise distances 
are given, but whose coordinate representation is not, can one determine geometric 
characteristics of the manifold, including its intrinsic dimension and volume content? 
Can one recover global properties of the distribution of {Yi}™ =1 such as its intrinsic 
entropy? These properties, as well as graph theoretic functionals such as clique counts 
in the Vietoris-Rips complex may be studied via the statistics of the form 

E^,{^}-=i), (i.i) 

i=l 
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where £(•, •) is a real-valued measurable function defined on pairs (y,y), where y G y 
and y C M d is locally finite, with £(y,y) locally determined in some sense. 

Our goal is to establish the dependency between the large n behavior of the statistics 
(II. ID . the underlying point density k of the {Yj}™ =1 , and the manifold Ai. For Ai = M. d 
there is a large literature describing limit theorems for (II. ip ((5J |35l |36j [37J (38J EES] ) 
whereas when Ai 7^ M. d there is a relative dearth of results, although spatial data 
generated by a curved surface embedded in three dimensional space are arguably more 
natural than those involving data generated by flat surfaces. This paper partly redresses 
this situation under reasonably general conditions on k and Ai. 

In Section [2] we present laws of large numbers and central limit theorems for (i) the 
Levina-Bickel dimension estimator for data {Yi}™ =1 supported on a manifold, (ii) Renyi 
and Shannon entropy estimators for {Yj}^, (iii) volume estimators for the support 
of {Yj}™ =1 and (iv) the order k clique count in the Vietoris-Rips complex on {Vj}™ =1 . 
The asymptotic normality results for dimension and entropy estimators appear to be 
new even in the setting of linear manifolds. The mean and variance asymptotics for 
the Levina-Bickel dimension estimator depend only on the dimension of Ai and are 
invariant with respect to k, whereas for most of the other functionals considered here, 
the mean and variance asymptotics explicitly depend on k via the integral f M (n{y)) p dy 
for some p G R. 

We shall derive these results from general theorems governing the limit theory of 
YJl =l t(n 1/m Yi, {n^Yj}]^), where m is the intrinsic dimension of Ai, n 1//m is a dilation 
factor, and £ belongs to a general class of translation invariant functionals that are 
determined by the locations of either the k nearest neighbors of y for some fixed fceN, 
or the points of y within some fixed distance of y. Our general results are presented in 
Section [3j 

The limit theory makes use of the fact that for such £, the large n behavior with 
respect to the shifted and dilated point sets {n l l m (Yj — Yi)}™ =1 is controlled by the be- 
havior of £ on homogeneous Poisson point processes on tangent hyperplanes of Euclidean 
dimension m, that is to say the Grassmannian manifold Gr m (d) of m-dimensional linear 
subspaces of M d . The locally defined behavior of £, quantified in terms of dependency 
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graphs involving radii of stabilization of £, yields central limit theorems via Stein's 
method. 

In fact, our methods often work in still greater generality. Most of the examples 
considered in [5j [351 EHJ E7J |38j HE] have £ stabilizing, i.e. locally determined in some 
sense, and it should be possible to adapt our methods to most of these examples. For 
example, we anticipate that our methods can be extended to establish the limit theory 
for the total edge length and other stabilizing functionals of the Delaunay and Voronoi 
graphs on random point sets in manifolds. We also expect that our methods extend to 
give the limit theory of statistics of germ-grain models, coverage processes and random 
sequential adsorption models generated by data on manifolds. Moreover, we anticipate 
that the theory presented here can be modified to establish limit theorems for generalized 
spacing statistics based on k nearest neighbor distances for random points in a manifold, 
including estimators of relative entropy such as those considered in [3], although these 
involve consideration of non-translation invariant £ so they do not automatically fall 
within the scope of this paper. 

2 Stochastic functionals on manifolds 

2.1 Terminology and definitions 

For k G N, let || ■ || be the Euclidean norm in M. h . Let denote a point at the origin of 
R k . For r G (0, oo) and z G K fc , let B r (z) := {y G M fe : \\y - z\\ < r}. Given F C R k , 
and y G R k , a > 0, set y + F := {y + z : z G F} and aF := {az : z G F}. If F is locally 
finite, let card(F) denote the cardinality (number of elements) of F. If also y G M. k and 
j G Z + := {0, 1,2,.. .}, then let Nj(y, F) be the Euclidean distance between y and its 
jth nearest neighbor in F \ {y}, i.e. 

Nj(y, F) := inf {r > : card(F R B r (y) \ {y}) > j} (2.1) 

with the infimum of the empty set taken to be +oo. In particular, No(y,F) = 0. Let 
$ be the distribution function for the standard normal random variable A/"(0, 1) and 
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let — > denote convergence in probability. For a > 0, let Af(0, cr 2 ) denote the random 
variable aAf(0, 1). 

Let m G N and d G N with m < d. A nonempty subset .M of M. d , endowed with the 
subset topology, is called an m-dimensional C 1 submanifold of R d if for each y G Ai 
there exists an open subset U of R m and a continuously differentiate injection g from 
U to R d , such that (i) y G <?(t/) C Ai, and (ii) g is an open map from U to Ai, and (iii) 
the linear map g'(u) has full rank for all u G U (see e.g. Theorem 2.1.2(v) of [7]). The 
pair ([/, g) is called a chart. Let M := M(m, rf) denote the class of all m-dimensional C l 
submanifolds of R d which are also closed subsets of M. d . 

Given Ai G M, using a routine compactness argument we can choose an index set 
IcN, and a set {{yji, Si, U{, gi),i G X} of ordered quadruples with yi G Ai, Si G (0, oo), 
and (Ui,gi) a chart for each i, such that (i) M. n B^iyi) C g yi {Ui) for each z, and (ii) 

X C U ie i%(z/i)- 

We refer to ((Ui,gi),i G X) as an ai/as for .M. Given such an atlas, we can find 
a partition of unity {ipi} subordinate to the atlas, that is, a collection of functions 
G X) from M. to [0, 1], such that ^2 ie r^i{y) = 1 for all y G Ai, and such that for 
each i, ipi(y) = for y ^ gi{Ui), and ^ o ^ is a measurable function on C/j. The more 
common definition of a partition of unity has some extra differentiability conditions on 
ipi but these are not needed here. With our more relaxed definition, the existence of a 
partition of unity is completely elementary to prove. 

Given % G X and x G Z7», let D g .(x) := a/ det (J gi (x)) T (J gi (x)), with J ft standing for 
the Jacobian of ^j. For bounded measurable /i : Ai — > R, the integral h(y)dy is 
defined by 

/ Hy)dy = y2 Kg i {x))ip i (g i (x))DgXx)dx (2.2) 

which is well-defined in the sense that it does not depend on the choice of atlas or the 
partition of unity. Equation ( 12. 2 p is discussed further at (15. lSj) below. 

Given any manifold Ai G M and /C C Ai, the relative interior of K, consists of 
all those y G K, such that y has a neighborhood in Ai that is contained in K,. The 
boundary of /C is the set of all other y G K (possibly empty). We say K C Ai is a C 1 
submanifold-with-boundary of .M if for all y in the boundary of /C, there exists a choice 
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of chart (U, g) for M such that E U, and g(0) = y, and g([0, oo) x IR m_1 ) = g(U) D /C. 
This includes the possibility that /C has empty boundary. 

Given .M G M, a probability density function on .M is a non-negative scalar field 
k on M satisfying j M K,(y)dy = 1. Let F(Ai) denote the class of probability density 
functions on Ai. Given k G F(Ai), let )C(k) denote the support of k, i.e. the smallest 
closed set K, C Ai such that L. n(y)dy = 1. Given also pel, define the integral 



Let Pb(A^) denote the class of bounded probability density functions n G P(A^), such 
that /C(k) is compact. Let P C (.M) denote those probability density functions n G F b (Ai) 
whose support K{k) is a compact C l sub-manifold-with-boundary of A4, and which are 
bounded away from zero and infinity on their support. 

Suppose MeM and k G F(Ai) are given. Let random variables Yi,Y 2 , . . . be i.i.d. 
with probability density function k with respect to the Riemannian volume element 
dy. Define the binomial point process y n := {Yj}^ =1 . Let V\ denote the Poisson point 
process on M. (and also the associated counting measure) having intensity density Ak(-), 
that is 



Recall that £(■, ■) denotes a measurable function defined on pairs (y,y), where y G 
y C M. d and y is locally finite. When y y, we write y) instead of £(y,y U {y})] 
also, we sometimes write y y for y U {y}. 

Let "H denote a homogeneous Poisson process of unit intensity in M m with W 71 em- 
bedded in M d (since m < d) so that the random variable £(0, "H) is well defined. In 
keeping with notation from [51 [35] , for all such functionals £ we set 



I p (k) := / := / (K(y)yi{)C(K)}(y)dy 

Jk(k) J M 



(2.3) 



EV x (dy) = \K(y)dy. 



(2.4) 




(2.5) 



and 




(2.6) 



whenever these integrals are defined. 
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2.2 Estimators of intrinsic dimension, manifold learning 

Given data embedded in a high dimensional vector space, a natural problem in manifold 
learning, signal processing, and statistics is to discover the low dimensional structure 
of the data, namely the intrinsic dimension of the hypersurface containing the data. 
Levina and Bickel [30J propose a dimension estimator making use of nearest neighbor 
statistics. Their estimator, which uses distances between a given sample point and its 
k nearest neighbors, estimates the dimension of random variables lying on a manifold 
Ai of unknown dimension m embedded in M. d , d > m. Specifically, for all k = 3,4, 
the Levina and Bickel estimator of the dimension of a finite data cloud y C Ai G M, is 
given by 

m k := rh k (y) := (card(^))- 1 ^ ( k (y, y), (2.7) 

yey 

where for all y e 3^ we have 

Uy,y)-.4 {k ~ 2) ^ Z>e ™ y1 ' l( ™ i[y)> - k + \ (2.8) 

I 0, otherwise 
where Nj(y) := Nj(y, y), as given by (12. ip . For all p > 0, we also define 

tkAv>y) ■= Ck{y,y)l{N k {y) < p}, (2.9) 
using the convention x oo = if necessary, and we put 

m Kp := m k>p {y) ■= (card(^))- 1 ^ C Kp {y, y). (2.10) 

yey 

Given y n := {1^}" =1 as in Section [2~T| Levina and Bickel [30] argue that rh k {y n ) 
estimates the intrinsic dimension of Ai. Our purpose here is to substantiate this claim 
and to provide further distributional results. The following shows (i) consistency of 
the dimension estimator m k over Poisson and binomial samples, and (ii) a central limit 
theorem for rh ktP (y n ), p fixed and small, addressing a question raised by Peter Bickel. 
Here 5^ is given by taking £ = ( k in (12 . 5 j) - ( T276]) . 
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Theorem 2.1 Let M £ M(m, d) and let k £ F C (M). For all k > 3 and p > 0, we have 

lim l{m k>p (y n ) + m k {y n )} = a.s. (2.11) 

n— »oo 

and if k > 4 i/ien 

rnk{y n ) — > Tn as n — )■ oo, (2.12) 

w/w/e if k > 11 then (\2.12l) holds a.s. If k is a.e. continuous then there exists p\ > 
such that if p £ (0, pi) and k > 7, then 

2 

777. 

lim nVar[m fc>p (^n)] = ^ 2 (G0 := i 5 - (5 Cfc ) 2 > 0, (2.13) 

and also we have as n — >■ oo, 

^ 1/2 (m fclP (yn) -M^(W) A ^(0,a 2 (a)). (2.14) 

Remarks, (i) {Counterexamples) Without further conditions on .M £ M, in general 
rhk might not satisfy the variance asymptotics (I2.13P or the central limit theorem ( 12.14)) . 
To see this, suppose m — 1 and d = 3, and suppose Ai is a compact 1-manifold which 
includes a segment S on the z-axis, say from z = to z = 1, as well as an arc of the unit 
circle in the (a;, y) plane. If Y t are i.i.d. with the uniform measure on Ai, then there is 
a positive probability that Y\ £ S and that its k nearest neighbors are all on the arc. In 
this case the Nj(Yi, y n ),j — 1, k — 1, all coincide and none of the moments of rhk{y n ) 
exist. 

For a more general counterexample along similar lines, fix m and put d — 2(m + 1). 
Let S be the set of unit vectors in M. d and let Si be those x £ S such that the first m + 1 
coordinates are zero and let S2 be those x £ S such that the last m + 1 coordinates 
are zero. Then Ai := Si U S2 is an m-dimensional manifold in M. If Yj are i.i.d. with 
the uniform volume measure on Ai, then there is a positive probability that Y\ £ S\ 
and that its k nearest neighbors all belong to S'2. In this case Nj(Yi,y n ) = 2 for all 
j = 1, k — 1, showing that Ck{Yi, iVn) is infinite with positive probability and therefore 
none of the moments of rhk(y n ) exist. 

(ii) (On the constant pi) The constant pi, loosely speaking, reflects the maximum 
amount of curvature of the manifold Ai. The larger this is, the smaller pi has to be 
taken. See Lemma [4.31 below. 
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(iii) (Relation to previous work) Levina and Bickel (Section 3.1 of [9]) argue on 
heuristic grounds that Var[mfc(3^ n )] = 0(n _1 ) whenever the Yi are the image under a 
sufficiently smooth map g of random variables having a smooth density, but the last 
counterexample shows that this bound is not true in general. 

Chatterjee [13] provides a rate of normal approximation for rhj : (y n ) for all k > 9 
whenever Ai is a 'nice' manifold and under minimal assumptions on the distribution of 
Yj. His rates are with respect to the Kantorovich-Wasserstein distance and are of order 
n (9-fc)/(2fc-2)^ SUD j ec t to the validity of Var[m fc (3 ; n )] = 6(n _1 ). 

Bickel and Yan (Theorems 1 and 3 of Section 4 of [10]) establish a central limit 
theorem for rhk(y n ) for Ai = M. d . The methods of Bickel and Breiman [9] or [5l [371 EH] 
could be used to establish the asymptotic normality of rhk(y n ) if the Yi had a density 
with respect to Lebesgue measure on M. d . These methods do not appear applicable in the 
present situation. Under strong assumptions on Ai, Yukich [19] outlines an approach 
giving a rate of normal approximation for rhk(V\), but does not provide consistency 
results or variance asymptotics for either rhk(V\) or rhk(y n ). 

(iv) (Limits are dimension dependent only) The mean and variance asymptotics 
f )2.12p and ( 12 . 1 3[) are invariant with respect to k and depend only on dim(A^), and (in 
the case of variance) the parameter k. These results appear to be new even for Ai = M d . 

(v) ( Variance asymptotics) For Poisson samples, a simlar result to Theorem 12.11 
holds (see Theorem 13.31 below) but with the limiting variance c 2 (C,k) modified to simply 
m 2 /(k — 3). Thus at least for Poisson samples, the limiting variance of the dimension 
estimator decreases with increasing k. 

2.3 Estimators of intrinsic entropy and volume content 

Renyi entropies. Let Y\ be as in Section 12.11 Given p > with p ^ 1, the Renyi 
p-entropy [H] of Yi, denoted H*(n), and the closely related Tsallis entropy (or Havrda 
and Charvat entropy [23]), denoted H p (k), are given respectively by 

H;(k) := (1 - p)- 1 log J p («); H p (k) := (p - l)" 1 (1 - J p (*)) , 

where I p (k) := ^ ]C ^(n(y)) p dy is at (I2.3p . When p tends to 1, H*(k) and H p (n) tend to 
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the Shannon differential entropy 

#i(«):=- / K(y)\og(K(y))dy. (2.15) 

J M 

Renyi and Tsallis entropies are used in the study of nonlinear Fokker-Planck equations, 
fractal random walks, parameter estimation in semi-parametric modeling, and data com- 
pression (see [T7] and the introduction of [29] for details and references). The gradient 
Yan p ^x{dH* / dp) equals (— l/2)Var[log «(¥].)], a measure of the shape of the distribution 
( |291 |4H] ) which also appears in the statement of Theorem 12.41 below. 

A problem of interest is to estimate the Renyi and Tsallis entropies given only the 
sample {^i}™ =1 and their pairwise distances. Here we show consistency results, variance 
asymptotics, and central limit theorems for non-parametric estimators of I p {k). The 
papers [I7J [2J [29J E7J HQ] consider estimators of I p {k) in terms of the k nearest-neighbor 
graph; here we restrict to k — 1, but this is for presentational purposes only. The 
approach taken here also yields consistent estimators of volume content. 

Recall that Ni(y, y) is the distance between y and its nearest neighbor in y. For all 
a G (— oo, oo) and finite y C M. d put 

yey 

For r G (0, oo), define the critical moment r c (n) G [0, oo] by 

r c (K) := sup{r > : E ||F 1 || r < oo}. (2.16) 

The next result spells out conditions under which ra _1 i? a (?T, 1/ ' m J ; n ) consistently estimates 
a scalar multiple of Ii- a /m( K ) m the L q sense. Let co m := 7r m//2 [r(l + m/2)]~ 1 be the 
volume of the unit radius m-dimensional ball. 

Theorem 2.2 If k G f c (M) and a G (0, oo), then 



1 R a {n 1 ' m y n ) u m a ' m T{l + -)/i_ a/m (K) as n -> oo (2.17) 

m 

with both L 2 and a.s. convergence. If instead k G P(A4) is bounded, and a G (— m/q,0) 
for q = 1 or q = 2, then holds with L q convergence. 



n 
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Putting m = a, we obtain consistent estimators of Iq{k), i-e., the m- dimensional 
content of the support of K. 

Corollary 2.1 If k E F C (M) then oo m R m (y n ) — > Iq(k) as n — > oo, with both L 2 and 
a.s. convergence. 

We now state variance asymptotics and a central limit theorem for R a (n l ^ m y^). 
Define V N " and 6 N ? by taking £ = N? in (1231)- (1231). 

Theorem 2.3 Suppose k G P c (M) is a.e. continuous, and a G (— m/2,0) U (0, oo). 
Then 

lim n- 1 Vax[K*(n 1 / m y n )] = a 2 (N?, k) := V N ? h. 2oi/m {K) - (5 N ? ' h- a /M) 2 (2.18) 

n— >oo 

and, as n — )■ oo 

n- 1/2 (i? a (n 1/m ^) -EJT'^X)) A N$,a 2 (N%,K)). (2.19) 
yUso, o*(N?,k) > and 

«j"f = (1 _ «/mX"/ m r(l + -) (2.20) 

m 

Remarks: (i) (Comparison of (2.11) with previous work) The limit (12.171) extends 



the law of large numbers limit theory for entropy estimators developed by Costa and 
Hero on manifolds [17J, who restrict to a G (0, m) and to compact manifolds. They 
extend the consistency results of Leonenko et al. (Theorem 3.2 of [29|), Theorem 2.1 
of Wade [17], Theorems 2.1-2.3 of [40j, and Theorem 2.4 of (38], all of which restrict to 
M=W m . In gQ| it is shown that if M = M m , then ( 12T7j) holds whenever a G (0, m/q), 
Ii- a /m( K ) < 00 an d r c (n) > qam/{m — qa). For M. = M m and k G F c (JA), under a 
Lipschitz assumption on k (along with an assumption on its gradient), Liitiainen et al. 
[31] develop a closed form expansion for the moments of R a , a > 0. 

(ii) ( Comparison of $2.18\) and $2.19\) with previous work) Existing central limit the- 
orems and variance asymptotics for the entropy estimators R a and the volume content 
estimator u m R m (e.g. Theorem 6.1 of [37]) assume that M. = W 71 and a > 0. The- 
orem [273] allows us to relax both assumptions. Subject to Var[i? a (n 1//m 34)] = G(n), 
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[T3] yields a rate of convergence in (I2.19p with respect to the Kantorovich-Wasserstein 
distance under minimal assumptions on the distribution of the Y^. 

Shannon entropy. The Shannon entropy H\(k) defined at (I2.15P is an information 
theoretic measure of how the data {Fi}" =1 is 'spread out'; low entropy implies that the 
data is confined to a small volume whereas high entropy indicates the data is widely 
dispersed. Accurate estimation of differential entropy is widely used in pattern recog- 
nition, source coding, quantization, parameter estimation, and goodness-of-fit tests; cf. 
the survey [6]. 

Shannon differential entropy is commonly estimated by first estimating the density 
k and then evaluating Hi(k ) where k is the estimated density; such methods involve 
technical complications involving bin-width selection for histogram methods and window 
width for kernel methods and are usually restricted to Ai = M. d . We bypass these 
technical issues and use only interpoint data distances to estimate entropy; the methods 
are thus applicable to general non-linear manifolds. This extends [271 129] , which restricts 
to Ai = M. d , and it lends rigor to the arguments in [321 133] . 

As in [27] [29] . we shall consider estimators of Hi(k) in terms of nearest neighbor 
distances. Put ip(y,y) := log(e 7 o; m iV{ n (7/, y)) where 7 := 0.57721... is Euler's constant 
(see e.g. [22]). 

For finite y put S(y) := *£ yey ^{y, y). Define by taking f = ip in 023]) . The 
next result provides the limit theory for the Shannon entropy estimators S^n 1 '™^). 

Theorem 2.4 Suppose k E P(M) and that either (i) k e F b (M) or (ii) M = W n and 
r c (/«) > 0. Then as n — >■ oo ; 

n^Sin^yn) Hx(k) in L 2 . (2.21) 

// k G F c (Ai) is a.e. continuous, then 

lim n^Var^n 1 /"^)] = a 2 (^, k) := - m~ 2 + Var[log /t(Fi)] > 0, (2.22) 

and 

n -i/2( S ( n i/my n) -ESin 1 ^)) ^M(0,a 2 &,K)). (2.23) 
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Remarks. The papers [27J[29] show that n~ 1 S(n 1 ' m y n ) consistently estimates Shan- 
non entropy when Ai = M. m , but they do not treat variance asymptotics, distributional 
results, or general manifolds. Theorem 12.41 redresses this. Subject to Var[5 , (ra 1 / m 3^n)] = 
Q(n), [13] yields (I2.23f) under minimal assumptions on the distribution of the Y{. 

2.4 Vietoris-Rips clique counts 

Let y C M d be locally finite and let j3 G (0, oo) be a scale parameter. The Vietoris- 
Rips complex TZ^(y), also called the Vietoris complex or Rips complex, is the simplicial 
complex whose /c-simplices correspond to unordered (k + 1) tuples of points of y which 
are pairwise within Euclidean distance (3 of each other. Thus, if there is a subset S of 
y of size k + 1 with all points of S distant at most from each other, then 5* is a k- 
simplex in the complex. The Vietoris-Rips complex has received attention in connection 
with the statistical analysis of high-dimensional data sets [H] , manifold reconstruction 
[15] . and gaps in communication coverage and sensor networks [HI [45], because of 
its close relation to the Cech complex of a set of balls (which contains a simplex for 
every finite subset of balls with nonempty intersection) . It has also received attention 
amongst topologists [12], who, given a data cloud y, allow the scale parameter (3 to 
vary to obtain homological signatures of the Vietoris-Rips complexes, which when taken 
together, yield clustering and connectivity information about 3^- The limit theory for 
the kth. Betti number of Vietoris-Rips complexes generated by random points in R d is 
given in [25l [26] . The general methods of this paper may be useful in extending these 
results to the setting of manifolds. 



Given y,{3, and k, let C k (y) be the number of /c-simplices (i.e., cliques of order 
k + 1) in TZ^(y). For example, 



l<i<7<n 



which is the empirical version of the so-called correlation integral 
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The quantity n. _1 C{ (y n ), with y n := {Yi}™ =1 and with Yj having unknown distribution 
/i, is the widely used sample correlation integral of Grassberger and Procaccia [21]. 
Grassberger and Procaccia [2TJ use least squares linear regression of logri -1 ^^^) 
versus log/3 to estimate the 'correlation dimension' of /z, i.e. the exponent when the 
correlation integral is assumed to follow a power law as (3 4- 0. The quantity n~ l C ( f\y n ) 
also features in estimators of the K function, as discussed in e.g. Chapter 8.2.6 of Cressie 

PS- 

The next result provides a law of large numbers and central limit theorem for the 
clique count n~ l C[ P \y n ) for any keN. Define h k : (R m ) fc+1 — )■ R by h k ( Xl , x k+1 ) := 
lli<i<j</fc+i l{\\ x i ~ x j\\ — 1}> i- e - the indicator of the event that x\, . . . ,x k +i are all 
within unit distance of each other. Given (3 G (0, oo), put 

Xl, . . . , Xj—i, X k+ i, . . . , X2k+l-j) 



J, 



k,j 



j\(k+l-j)\* 

dxi... dx 2k +i-j, (2.24) 



so in particular J k , k +i '■= J * • • / h(0, x±, . . . , x k )dx\ • • ■ dx k /(k + 1)!. Set 

:= a-209, «) := ^ J k>j f3 m{2k+1 - j) j - ((* + l)P mk J k , k+1 I k (K)) 2 . (2.25) 

Theorem 2.5 Let k be bounded on M. G M. For all k = 1,2, ... and a// /3 G (0, oo) we 

aai>e 

lim n _1 Cf } (n 1/m y„) = P mk J k , k+ ih+i^) in L 2 and a.s. (2.26) 

n— >oo 

///t a.e. continuous and if n G P?,(A^) ; tnen 

lim n-Varpf ^n 1 /" 1 ^)] = <rj>(/3, «) > (2.27) 

and, as n — > oo 

n~ 1/2 (Cjf\n 1/m y n ) -ECjf\n 1/m y n )) J\f(0,crl({3, k)). (2.28) 

Remark (Related work). Bhattacharya and Ghosh [8] used the limit theory of [/-statistics 
to obtain a central limit theorem similar to (12. 28ft for Poisson input, in the case where 
7W = R m and k is uniform on the unit cube. The limit (I2.26P extends the results in 
[34] (Proposition 3.1, Theorem 3.17) to non-linear manifolds whereas (I2.27P and (I2.28P 
extend Theorem 3.13 of [31] to non-linear manifolds. 
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3 General limit theorems 



As in Section EH1 £(y, y) is a real- valued functional defined on locally finite J'cR' 1 and 
y G y, and Yi, % > 1, are i.i.d. with density k. We shall say that £ is translation invariant 
if £(y , y) = £(z + y, z + y) for all 2; G M d and all (y, 30 . We say £ is rotation invariant if 
£(0, 3-0 is invariant under rotations of y, for all 3^- That is, we say £ is rotation invariant 
if £(0, 30 = £(0, A30 for all orthogonal m x m matrices A, where A3^ := :G 3^}- in 
this section we provide a general limit theory for the sums Y17=i £,( nl ^ m Y, {n 1 / m Y i }™ =1 ), 
namely Theorems 13.11 [3721 and 1331 We shall use these general results to prove the results 
in Sections I2T2H2T1 

We introduce a scaled version of £, one dilating the pair (y, y) by the factor A 1//m ; 
this scaling is natural when y d M. has cardinality approximately A and Ad is an 
m-dimensional manifold in M d . Thus, given fceN, for X, p G (0, 00) set 



Here we are allowing for a finite macroscopic cutoff parameter p because for some man- 
ifolds £ n (y,3n) may suffer from non-local effects even when n becomes large; see the 
counterexamples in Remark (i) of Section 12.21 

Given k G Z + and r > 0, let r) be the class of translation and rotation invariant 
functionals £ such that (i) for all y, y with card(3^ \ {y}) > k we have 



and (ii) for all n, Lebesgue-almost every (yi, . . . ,y n ) G (IR m ) ra (with M. m embedded in 
M. d ) is at a continuity point of the mapping from (R d ) n — > R given by 



W^) ■= ^ 1/m y^ 1/m y)HN k (y,y) < pY, 
My, y) ■= (x, k ,oo(y, y) ■■= £(A 1/m y, x 1/m y). 



(3.1) 
(3.2) 



£(y, y) = £(y, y n B max{r , Nkiyy)) {y)) 



(yi, ...,y n )^ £(0, {y u . . .,y n }). 



For finite y C A4, and for k G Z + , A G (0, 00) and p G (0, 00] define 





(3.3) 
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Recalling that y n := {yj}™ =1 and that V\ is a Poisson point process on M. of intensity 
Xn(y)dy defined at (12.41) . we now give a general law of large numbers for scaled versions 
of the linear statistics ( 11. ip when £ G E(fc, r). For 2 G Z + , let be the collection of 
all subsets of /C(«) of cardinality at most z (including the empty set). Consider the 
following moment conditions on £: 

supE\^ kt „(XiM\ p <oo, (3.4) 

n 

sup sup E|£ nifciP (?/,^U^)| p < oo (3.5) 

n>l,ve1C(K),AeSa (n/2)<£<(3n/2) 

(noting that 03. 5p implies (I3.4I) ). and 

sup E \£x,k,p(y, UA)\ P < oo. (3.6) 
\>i,yeic(K),AeSi 

Theorem 3.1 (Laws of large numbers for £ G E(k, r)) Lei M G M, k 6 F(M), k G Z + 
and p G (0, oo], and put q = 1 or q = 2. Let £ G H(fc, r) and suppose there exists p > q 
such that < \3.4\) holds. Then as n ^ oo we have L q convergence 



n- l Hl tKp {y n )^ I E[^0,n K(y) )]K(y)dy. (3.7) 

J M 

If also k G F c (Ai) and = or k G P c (.M), and i/ ao/ds /or some p > 5, then 
( ]X?D no/ds a.s. 

Given a > 0, let "H a denote a homogeneous Poisson process of intensity a in W" 1 
(embedded in R d ). Extending the earlier definitions (12. 5p and ( 12. 6p . we set 



Vt(a) :=E£(0,H a ) 2 + a / {E£(0, ?Q)Z(u, H° a ) - (E£(0, H a )) 2 } du 



(3- 



and 



5^a):=EaO,n a ) + a [ E [£(0, K) ~ U a )\du, (3.9) 
so in particular V^(l) = and 5^(1) = 5^. For all k G P(.M) and £, we define 

<t 2 (£,k):=/ Vt{K{y))K{y)dy-( I 6*( K (y))K(y)dy) , (3.10) 



provided that both integrals in ( 13. lOf) exist and are finite. 
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Theorem 3.2 (Variance asymptotics and CLT for £ G S(fe, r)) Lei 6 M and /ei 
k G P;,(.M) fre a.e. continuous. Let k G Z + ; r > 0, and p G (0, oo]. Assume k = or 
k G P C (.M). Lei £ G S(/c,r) and suppose that £ satisfies 0.51) and (Iff.fl) /or some p > 2. 
Then a 2 (£, k) < oo and 

lim n- l Y^{Hl Kp {y n )] = o 2 (£, «) (3.11) 

and as n — > oo, 

^ 1/2 (<^)-E<^ n ))AAf(0,, 2 (( lK )). (3.12) 

Remarks, (i) (Related work) Under a slightly different set of assumptions, Chatterjee 
[T3] provides estimates for the Kantorovich-Wasserstein distance between the distribu- 
tion of H^(y n ) and the normal, which imply a central limit theorem subject to the valid- 
ity of Vax[H^(y n )] = Q(n). Indeed, if in Theorem 3.2 we take p = oo, r = and p > 8, 
and if also o~ 2 (£, k) > 0, then combining (13. lip with Theorem 3.4 of [13] shows that the 
convergence (13 . 1 2[) is at rate (^(n 4 ^" 1 / 2 ) with respect to the Kantorovich-Wasserstein 
distance. 

(ii) (Simplification of mean and variance asymptotics for homogeneous £) The limits 
(13. 7p and (13. lip take a simpler form when there is some /3 G M such that £ is homogeneous 
of order (3, meaning that for all a G (0, oo) and all y G y C M m , we have £(ay, ay) = 
a^(y,y). In this case Z(0,H a ) = ^(O^-^n) = a-^ m £(0, U), so (CUD becomes 

n-'HUy^ /i^^E^O,^) (3.13) 

and similarly, by definitions (12. 5p . (12.61) . we can show that (13.81) and (13. 9p simplify to 

V*(a) = a~ 2 ^ m V^ and 5^(a) = a~^ m 5^ Hence, in this case (l3TTU|l becomes 

a 2 (£, k) = V*/i_ 2/3/m (K) - (^.^(k)) 2 . (3.14) 
If £ is homogeneous of order 0, we say it is scale invariant. 

(iii) (Limit theory for random measures on manifolds) Consider the point measures 

n 

Mkp^EW^Jnfe- (3-15) 
i=\ 
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where 5 y denotes the unit point mass at y. As in [5j |35j [36] , Theorem 13.11 admits an 
extension to the random measures (I3.15P as follows. Let B(M) be the space of bounded 
measurable real- valued functions on M. and for / G B(Ai), and fi a measure on A4, let 
(/,//) denote the integral of / with respect to /i. Put q = 1 or q = 2. Let £ G E(k,r) 
and suppose that there is a p > q such that (13 .4p and (I3.6P are satisfied. It can be shown 
that for all / G B(M), 

lim^- 1 (/,/i^ jP )= / f(y)E[aO,n K{y) )]K(y)dy in L q . (3.16) 

Similarly, if £ G S(k,r) satisfies the moment assumptions of Theorem 13.21 it can 
be shown that n~ l / 2 ({f,^ nkp ) — E (/, ir nk }) converges in distribution to a mean zero 
normal random variable with variance 

/ f(y) 2 vt«y))<y)dy-( I H<v))f(v)<v))*y) ■ 

JM \JM J 

We refer to [35] for details. 

(iv) ( Other junctionals) The approach also works for more general functionals than 
£ G H(/c,r). Along with the moment conditions already discussed, the key properties 
£ needs to satisfy are exponential stabilization and continuity, which we discuss in the 
proof and which represent £ being locally determined in some sense (cf. Remark at the 
end of Section [6]). Also, the approach works for marked point processes, where the points 
carry independent identically distributed marks. We would expect other functionals to 
satisfy this, as has been considered for a variety of functionals in the case Ai = M. m 

(v) (General random input) It would be of interest to consider input with a d- 
dimensional noise component, but we do not do so here. 

In (13. 7p we have presented the law of large numbers for binomial samples, but the 
same L q limit holds for functionals of the form A _1 if| k p {V\), with V\ as in (12.41) . 
Likewise, there is a Poisson analog to (I3.16p . 

We also have variance asymptotics and a central limit theorem for k p (V\), similar 
to Theorem 13.21 but with a different limiting variance. Moreover, in the Poisson setting, 
we have a bound on the rate of convergence to the normal, using the Kolmogorov 
distance. The result goes as follows. 
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Theorem 3.3 Let M E M and let k E ¥ b (M) be a.e. continuous. Let k E Z + , r > 
and p E (0, oo], and suppose k = or k E F c (M). Let £ E E(k,r) and suppose that £ 
satisfies $3.6}) for some p > 2. Then 

lim A^Var^, (V x )} = r 2 ^,n) := / V^y, K(y))n(y)dy < oo. (3.17) 



A— !> 

and as n — >■ oo ; 



A- 1/2 (^(^)-Ei7l fciP (P A 



A^(0,r 2 (e,K)). 



(3.18) 



Additionally, if ^ satisfies $3.6]) for some p > 3 and if a 2 (^,n) > 0, then there exists a 
finite constant C depending on d, k, k ; p ; and p such that for all A > 2, 



sup 



Var[tf* (P A )] 



< t 



< C(logA) 3m A- 1/2 . 



(3.19) 



As well as being of independent interest, Theorem 13.31 is used in our proof of Theorem 
13.21 Equation (13.191) is the counterpoint for manifolds to the rate of normal convergence 
result in [39]. Theorem 13.31 could be used to provide Poisson analogs to the results 
presented in Sections l2.2H2.4l 



4 Geometrical preliminaries 



The lemmas in this section, concerned with properties of manifolds, have no probabilistic 
content. The first of these relates distances in the manifold to distances in a chart. 

Lemma 4.1 Suppose (U,g) is a chart for Ai E M(m, d). Suppose F C g{U) is a 
compact subset of Ai. Then 

„ \\g-\z) -g-Hy)\\ Wg'H*) - g~Hy)\\ , x 

< inf ny Y — ,, yy,u < sup uy Y — n < oo- (4.1) 

y,zeF:y^z \\ Z — y\\ y,zeF:y^z \\ z ~ Vll 

Proof. Suppose (14.1 ft fails. Then by compactness, we can find a sequence (y n , z n ),n E N 
with y n E F, z n E F \ {y n }, and y n y for some y E F, such that setting u n := g~ 1 (y n ) 
and v n := g~ l {z n ), we have either 

IK -■"nll/ll^-2/nll ->• 00 as n ->■ oo, (4.2) 
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or 

\\v n — Wnll/H^n — Vn\\ ~ > as Tl — > OO. (4.3) 

Since g is an open map, the set g~ 1 (F) is compact. Hence \\v n — u n \\ remains bounded. 

Suppose (14. 2 p holds. Then \\z n — y n \\ — > 0, and hence z n — > y as n — > oo. Setting 
u := g~ l {y)i by continuity of g^ 1 we have u n — >■ u and n n — >■ w. Hence, arguing 
componentwise using the mean value theorem and the continuity of g', we have that 

\\z n -y n - g'{u){v n - u n )\\ = o(\\v n - u n \\) (4.4) 

and therefore since g'(u) has full rank, 

Hminf \\Zn-y n \\ = Uminf \W(u)(v n -u )\\ > Q) 
n-*oo \\v n — U n \\ n->oo \\y n — u n \\ 

which contradicts (14. 2p . On the other hand, if H4. 3|) holds we can show by a similar 
argument that limsup^^dl^n — J^ll/H^n — u n\\) < oo, again giving a contradiction. □ 



Recall that )C(k) denotes the support of n, and if k G ¥ c (Ai), then /C(k) is a 
compact C 1 submanifold-with-boundary of Ai. For w E E C M. and r > 0, let 
Bf (w) := £ r (w) fl E. Recall that w m := vr m / 2 [r(l + m/2)]" 1 is the volume of the ball 
£i(0) in R m . 

Lemma 4.2 Suppose k G P c (jM). Suppose y^ G K,{k), and suppose for n G N we are 
given y n G JC(k), r n > 0, a n > wift y n — > t/oo, r n — >■ and a„ — > as n — >■ oo. Then 



and 



limsup ( r n m / dy \ < u m , (4.5) 



hminf(r- m / ] > ^, (4.6) 



n— >oo 



and putting s n = r n (l — a n ), we aave 

dy 1 < oo. (4.7) 



limsup ((c-cr 1 / 



)\B#GhO 
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Proof. Let (U,g) be a chart such that G U and g(0) = y^. Assume also the chart is 
chosen so that U fl <7 _1 (/C) D U fl (M + x M m_1 ) (with equality if is on the boundary of 
£(«)). Set t/ n = g^iB^iyn)) C [/, and set K = g- 1 {B^{y n )). Let £ denote Lebesgue 
measure, and note that by continuity sup ugC/n |(D g (w)/D s (0)) — 1| vanishes as n — > oo. 
Thus there exists n such that for n > n we have B^ /l (y n ) C and by (12. 2p . 



L 



dy = D g (0) [ (D g (u)/D g (0))du ~ D g (0)£(U n ) (4i 



and 



/ dy = D g (0) [ (D g (u)/D g (0))du ~ D g (0)£(U n \ V n ) (4.9) 

JB^(y n )\B^(y n ) Ju n \V n 



and 



rfy> / D g (u)du~ D g (0)£(U n n(R + xR" 1 - 1 )), 

JUnnCR+xR™- 1 ) 



where the asymptotics are as n — > oo. Given n > n , set u n := g~ 1 {y n ) £ f ■ We claim 
that 

limsup sup r~ l \\g'(0)(v - iz n )|| < 1; (4.10) 
liminf inf r^H^O)^ - u n )\\ > 1. (4.11) 

n-voo v£U\U n 

To see (I4.10p . take t> n G Z7 n for nGN. By continuity of g -1 , we have v n — > as n — > oo 
so by applying the mean value theorem and using continuity of g', as in ( 14. 4 p we have 
that Hfl'(uri) — Vn — 9'{0)(v n — u n )\\ = o(\\v n — u n \\) as n — > oo. Therefore since g'(0) has 
full rank, 

\\g(v n ) - y n \\ ~ g'(0)(v n -u n ) as n -)> oo. (4.12) 

Then (I4.10p follows because ||^(v n ) ~ Un\\ < r n and the choice of v n G U n was arbitrary. 

To see KT\\ . let v n eU\U n with w n as n -> oo. Then ( H~T2]) holds by the 
same argument as before, but now ||g(t> n ) — y n \\ > r n and (14. lip follows since the choice 
of v n G U \ U n was arbitrary. 
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By (14.1 Op . given e > we have for large enough n that 

C(U n ) < £({v G R m : \\g'{0){v - u n ) || < r n (l + e)}) = (1 + e) m cu m r™/D g {0), 

and (14. 5 p follows by (14. 8p . By (14. lip , given e > we have for large n that 

C(U n n (R+ x l" 1 - 1 )) > ^£({t; G M m : ||</(0)(<; - w n )|| < r n (l - e)- 1 }) 

= l(l-e)- m u m r™/D g (0), 

and (14.61) follows. 

Finally we prove (14. 7p . Since (1 — a)" 1 < 1 — a for all a e [0,1], for large n we have 
C - s n = C(l " (1 - «n) m ) > Ca n . (4.13) 

We claim that if (v n ) nE ^ is a sequence in [/ with ||g(t> n ) — y n \\ = r n then setting w n := 
m„ + (1 — 3a n )(v n — u n ), we have ||j(ti; n ) — y n \\ < s n for large enough n. Indeed, for any 
such sequence by the mean value theorem we have r n = \\g(v n ) —y n \\ ~ ||5 , '(0)(t' n — u n )|| 
as in (I4.12p . and also 

\\g{w n ) ~ g{v n )\\ ~ 3a n \\g'(0)(v n - u n )\\ ~ 3a n r n . 

Moreover g(w n ) — y n and g(v n ) — y n are almost in the same direction, so that for large 
n, \\g(w n ) — y n \\ < r n (l — 2a n ), and the claim follows. 

By the preceding claim, there is a constant C such that the thickness of the deformed 
annulus U n \ V n in all directions is bounded by Ca n r n , so by using polar coordinates we 
have C{U n \ V n ) = 0(r™a n ), and then using ( 14. 9ft and ( 14. 131) we have that 

/ ^ = 0(r> n ) = 0(C-C), 

JBM( yn) \ B M iyn) 

demonstrating (14.71) . □ 
Given M G M and K G F b (M), set A(«) := diam/C(fi;) := sup{||a;— y\\ : x,y £ )C(k)}. 



22 



Lemma 4.3 Suppose k G F c (Ai). Then there is a constant Cq G (0, oo) such that for 
all r G (0, A(k)] and w G /C(k) we have 

C Q l r m < [ dy< [ dy< C r m . (4.14) 

There are also positive finite constants C\ and p\ such that if < s < r < p\ and 
w G JC(n) then 

[ dy< d(r m - s m ). (4.15) 

Proof. In the proof, set /C := K(k) and A := A(/c). Suppose (14. 14ft fails; then there 
must be a (K. x (0, A])-valued sequence {(y n , r n ), n G N} such that either 



n— ¥oo 



lim r~ m / rf2 = (4.16) 



or 



n— >oo 



lim r n " m / rf2 = oo, (4.17) 

<BM{y n ) 



Since /C x [0, A] is compact, by taking a subsequence we may assume without loss 
of generality that y n — > y and r„ 4 r for some y G /C and r G [0,A]. If r = then 
dUED would contradict gSJ and (14TT71) would contradict (jOi . If r > and (OEJ) 
holds, then since Bfj 2 {y) C B^ n (y n ) for large enough n, we would have J sK ^ c?2; = 0, 
which is impossible. If r > and (14.171) holds, then since B rn (y n ) C B 2r (y) for large rz 
we have J Ba , , rf^; = oo, which is also impossible: indeed by compactness B 2r (y) PI M. is 
covered by finitely many of the regions gi(Ui), and f g .nj.\ Ki{x)dx is finite for all z (we 
may assume the charts were chosen so all the regions Ui are bounded). Therefore we 
have a contradiction so (I4.14p must hold. 

It remains to prove there exists positive p\ such that (I4.15P holds for all w G K, and 
< s < r < p\. Suppose this is not the case. Then there is a sequence {(y n , r n , a n ),n G 
N} taking values in K x (0, A] x (0, 1) such that r n — > 0, and setting s n = r n (l — a n ) we 
have 



n— >oo 



lim (C - s™)" 1 / d y = oo. (4.18) 

'BM(y n )\ B M(y n) 
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By taking a subsequence, we may assume that y n — >■ y for some y G /C, and either a n is 
bounded away from zero or a n — >■ as n — > oo. 

If inf ne N{a n } > 0, then (r™ — s™)' 1 = 0(r~ m ) so (14. 181) would give a contradiction 
of (14. 14ft . If a n -)■ and r n ->■ 0, then (14. 18j) would give a contradiction of (14 . TP . □ 

5 Weak convergence lemmas 

For all d G N, we put a topology T := 7d on locally finite point sets in M. d . As in 
Aldous and Steele (pQ, page 250), we adopt a topology whereby a sequence of locally 
finite point sets (y n ) n >i converges to a locally finite y, if and only if (i) it is possible to 
list the elements of y as a possibly terminating sequence (yi,i > 1) and the elements of 
y n as a possibly terminating sequence (y n> i,i > 1) in such a way that 

lim y n>i = y u Vi (5.1) 

n—toa 

and (ii) for any L with no point of y on the boundary of Bl(0), we have 

lim card(y n n B L (0)) = card(y n B L (0)). (5.2) 

We would like to know that whenever point sets y n C M. d are close to the point set 
y in the topology Td then £(y,y n ) is close to £(y,y),y G This motivates Definition 
15. II below. Recall that H denotes a homogeneous Poisson point process of unit intensity 
on R m . 

Definition 5.1 £ is continuous if for any linear F : R m — > M. d of full rank, for almost 
all z G W m both F(7i) and F(1-L z ) lie a.s. at continuity points o/£(0, •) with respect to 
T d . 

If U n and U are simple point processes (i.e. random locally finite point sets in 
M. d ), then, following the discussion in [T] page 251, we shall say that U n converges in 
distribution to hi if the law of U n converges weakly to that of hi under the (metrisable) 
topology Td, which is the same as the notion of weak convergence of point processes 
discussed in Daley and Vere- Jones [!§] . Section 11.1. 
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Given the atlas ((Ui, gi), i G X), for i G X define the function : C/j — > [0, oo) by 

= K(gi(x))D g .(x), x G C/j. (5.3) 

By using (12. 2 p with a partition of unity for which ipi = 1 on gi(Ui), we see that for Borel 
BCUt, 



K,(y)dy = / K(gi(x))D gi (x)dx = / Ki(x)dx. (5.4) 
Si(B) 7b ./b 

Given a > and x G C/j, by g'i(x)(T-L a ) we mean the point process in M. d obtained by 
applying to % a the linear map g[(x). Similarly, g' i (x)(z) is the image of z under the map 
g'i(x). If A4 G M(m, d) or .M is an open subset of M m , and / : Ai — > R is measurable, 
then we say w e M is a Lebesgue point of / if £~ m f B i w )nM I/O/) ~ f( w )\dy tends to 
zero as e 4- 0. 

Lemma 5.1 Suppose i G X, Ui is bounded, and u G Ui is a Lebesgue point of hi. 
Suppose £(n), n G N is a sequence of integers with £(n) ~ n as n — )■ oo. S'et ?/o = 5'i( M )- 
T/ien as n —7- oo we /iave fin i/je above sense of convergence of point processes in M. d ) 



n 



l/mi 



Vo + yt(n)) — > 9'i{u)(V, ki{u) ). (5.5) 



Proof. By taking B = Ui in (15 .4p . we see that j v ki(x)dx < 1, so that hi is a 
(possibly) defective density function on Ui. Extend hi in an arbitrary manner to a 
probability density function on R m . 

Let X n be a point process in M. m consisting of £(n) independent identically distributed 
random m-vectors X^i, . . . , X^n) with density hi. Then 

ft^wn^D^n^ (5.6) 

because for Borel 5 C (EHJ) shows that P\Y X G = P[X 4jl G £]. 

By Lemma 3.2 of [36] (restated as Lemma 3.2 of |35j), the distribution of the point 
process n x l m (—u + A^ )Tl ) converges weakly to that of 'Hf H {u)-> m the metric of [36], which 
is not the same as the one we are using here (as discussed in [36] , the metric in [36] is 
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complete but not separable). We claim that if / : R m — > M is measurable with bounded 
support, then 

lim E exp I v 73 ! f(x) = E exp I V^T ^ f(x) . (5.7) 

\ xgnVm / \ ^WiSjW / 

Indeed, by the proof of Lemma 3.2 of [35] there is a coupling in which the random 
variables under the expectations on the left and right hand sides of (15. 7p are equal with 
probability tending to 1. 

It follows from (15. 7p that for any finite collection of bounded Borel sets Aj (1 < j < k) 
in M m , the joint distributions of the variables card(n 1 / m (— u + X n ) fl Aj), 1 < j ' < k, 
converge to those of the variables cardf^^^) D Aj), 1 < j < k. Hence, since Ui is a 
neighborhood of u, the joint distributions of the variables card(n 1 /" 1 (— u+(X n nUi))(lAj) , 
1 < j < k, converge to those of the variables card("H£(u) H Aj), 1 < j < k. 

Therefore the point processes n 1 ^ m (—u + (X n (~)Ui)) converge weakly to Hk^u), m the 
sense discussed at the start of this section (see Theorem 9.1. VI of |19j). 

Now we argue as in ([1], page 251). By the Skorohod representation theorem, we 
can choose coupled point processes X n and T-L^u), all on the same probability space, 
such that X n has the same distribution as X n C\Ui, and %K,du) nas the same distribution 
as Ha^u), and such that n 1 / m (— u + X n ) converges almost surely to Ha^u)- That is (see 
(15.1 p and (15. 2p ). we can list the points of H^u) as Xi,x 2 , ■ ■ ■ and the points of X n as 
x n ,x, x n ,2, ■ ■ ■ , Xn,N n , in such a way that for each j we have almost surely 

n 1/m (x nJ - u) ->• Xj (5.8) 

and for any L > with no point of Ha^u) in -Bl(O), we have almost surely 

card(n 1/m (-u + X n ) n B L (0)) ->• card(?4 i(u) fl B L (0)). (5.9) 

By (I5.8p . x nj - — >• u as ri — >• oo, so by the differentiability of gi, we can write 

9i(x n ,j) - 9i(u) = g'^ixnj -u)+ w nJ , (5.10) 

with ||u> n ,j|| = odl^nj — u\\) so n l / m w n j — > as n — > oo. By (15. 8p and (I5.10p . 

n l/m (gi(x n j) - gi(u)) ^(w)^) as n -» oo. (5.11) 
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We claim that the point process n 1//jn (— gi(u) + gi(X n )) converges a.s. to g[{u)(^^ u )). 
The condition corresponding to (15. ip follows from (15. lip . To demonstrate the condition 
corresponding to (I5.2p . set y :— giiu) and let L be such that no point g' i {v){xj) lies on 
the boundary of -Bl(O). We need to show that 

lim card[n 1/m (-y + 9i {X n )) n B L (0)] = card^u) n B L (0)]. (5.12) 

n— >oo 

Choose 5 > such that B 2 s{y) C gi{Ui). By Lemma [4.11 we may define finite by 

JT:= sup ll^OO-^ll/lk-*!!- 

Let if' > and suppose x E Ui with ||x — it|| > n^ 1 ^ m K'L. By definition of i^, if 
g(x) E Bs(y) then n^ m \\g(x) — y\\ > L, and this also holds if g(x) Bg(y), provided 

Sn l/m > L 

Hence, for n large the contribution to the left side of ( 15.12)) comes only from x E 
-i/™ K r L (u). For large enough n, the set of such x consists precisely of those x n j 
such that Xj E Bk>l{0), provided K' is chosen so that no point of Hk^u) lies on the 
boundary of Bk>l(0). 

By (15. lip , for large enough n the set of j such that x n j contributes to the left side 
of (15 .7p is precisely those j such that g'(u)(xj) E B L (0). Thus we have (I5.12p . and 
therefore n l / m {— y + gi(X n )) converges almost surely to ^(ifK'H/^fa)) as claimed. Hence 
n 1 / m (— y + gi(X n H Ui)) converges in distribution to ^'(u)^^^). Together with (15. 6 j) 
this yields 

n i/m(_ y + ( %(B) n g^Ui))) A g' t {u){U' ki{u) ). (5.13) 

By again using Lemma 9.1. VI of [19] (equivalence of weak convergence and convergence 
of fidi distributions) and the fact that gi(Ui) is a neighborhood of y in A4, we can deduce 
that (I5.13P still holds with 3^(n) ^9i(Ui), replaced by on the left hand side, so that 
(15. 5 p holds as asserted. □ 

The next lemma is a two-dimensional version of Lemma 15.11 



27 



Lemma 5.2 Suppose 1 el and 2 G X ; and gi(Ui) fl g 2 (U 2 ) = awe? a/so ?7i fl f/2 = 
and f/i U f/2 bounded. Suppose that for % — 1,2, Xj G £7j a Lebesgue point of hi, 
and set yi = gi(xi). Suppose (£(n),n E N) is a sequence of positive integers such that 
£(n) ~ n as n — >• 00. T/ien «sn-} 00, 

h 1/m (-2/l + ^(n)),W 1/m (-y2 + ^(n))] A ^(Sl)^^)).^^)^^))]. ( 5 - 14 ) 

where % a here is an independent copy ofH a . 

Proof. By ( 15. 4ft we have Ki(x)<ir + k 2 (x)dx < 1. Since we assume C7i fl C/2 = 
and U\ U f/2 is bounded, we can therefore find a probability density function k on W 71 
which is an extension of both R\ and k 2 , i.e. with ) for x G Ui, i G {1,2}. 

Let X n be a point process in W 71 consisting of n independent identically distributed 
random m- vectors X\, . . . , X n with density k. Then 

(gi 1 (ye(n ) ng 1 (u 1 )),g^(y n ng 2 (u 2 ))) = (x e(n) nu u x i(n) nu 2 ) (5.15) 

because for % = 1, 2 and Borel 5 C [/;, (E3D shows that P[Y1 G g^B)] = P[Xi G B]. 

By Lemma 3.2 of [36] (restated as Lemma 3.2 of [35]), the joint distribution of the 
point processes n l / m {— Xi+X^), vt}l m [— x 2 + X^), converges weakly to that of "H^^i), 
% 2 (3: 2 ), in the metric of [36]. We can then follow the proof of Lemma ISTTl with straight- 
forward modifications to deduce ( 15. 2ft . □ 

Before proceeding we shall re-express the Poisson processes appearing in the limits 
(15. 5p and f !5.14j) in a manner that is intrinsic to Ai, i.e. not dependent on the choice 
of atlas. Recalling that Gr m (d) is the Grassmannian, given Ai G M and y G Ai, let 
TyM. G Gr m (d) be the hyperplane tangent to — y + Ai at 0, i.e. the image of M m 
under the linear map g[{x) when x G W 1 and (Ui,gi) is any chart such that x G Ui 
and y = gi(x). We normalise the Lebesgue measure on T y AA (with volume element 
denoted du) in such a way that for any orthonormal basis of the subspace, the set 

{Si^=i a «/i : < aj < 1} has unit Lebesgue measure. For y,z £ Ai, let H' ytK r y ) denote a 
homogeneous Poisson point process on T y Ai with intensity n(y), and let T-L' z K ^ denote a 
homogeneous Poisson point process on T z Ai with intensity k{z), independent of %' y ,y 
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Lemma 5.3 Suppose z\ G M. and z 2 G M. are distinct Lebesgue points for k. Suppose 
(£(n),n > 1) is a sequence of positive integers with £(n) ~ n as n —¥ 00. Then 

[n^ m (-z 1 + y e{n) ),n 1 / m (-z 2 + y e{n) )] A {U' ZlMzi) ,U' Z2Mz2) ). (5.16) 

Proof. It is easy to see that we can choose our atlas (Ui,gi)i^x such that Zi G QiiUi) for 
i = l,2, and such that moreover gi{U\) D #2(^2) = 0, and Ui PI U2 — 0, and f/i U U2 is 
bounded. Let Xi := g~ l (zi) for i = 1,2. Then by Lemma [5721 to prove (15.161) . it suffices 
to demonstrate for i = 1, 2 the distributional equality 

^(^)(^(^))=K,^)- ( 5 - 17 ) 

Let z = 1 or i = 2. By the Mapping Theorem on p. 18 of [28J, 5^(2^) ("Hs^)) is a Poisson 
process on the linear space g'^x^iR" 1 ) with intensity measure [i where fi(B) is Kj(xj) 
times |(5f 4 / (xj)) _1 (5)|, where | • | denotes the m-dimensional Lebesgue measure. 

Recall from Section [2711 the definition of D g .(x). For bounded measurable A C M m , 
it is a fact from linear algebra that 

1^)^)1=^(^)1^. (5.18) 

Indeed, the columns of the Jacobian matrix J gi ( the images under g'(xi) of the 

standard basis vectors of M m so (I5.18P clearly holds when the standard basis vectors 
map to an orthonormal system, but then it can be deduced in the general case using 
standard properties of determinants. Equation (15.181) is the basis of the formula (12. 2 h 
given earlier. 

By flEUD, if A = {glixi))' 1 ^), then \B\ = D gi { Xi )\A\ so that n(B) = R^x^/ D gi { Xi ) 
so by O, n(B) = K(zi)\B\ and fl5Tfl) follows. □ 

Next we give weak convergence results for £. Recall that is the origin of W m . The 
next lemma is an analog of Lemma 3.6 of [35] and Lemma 3.6 of [36] . 

Lemma 5.4 Suppose £ is continuous in the sense of Definition 15.11 and rotation in- 
variant. Let y G M. and z G A4 be a pair of distinct Lebesgue points for k with K,(y) > 
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and k(z) > 0. Suppose (£(n),n > 1) is a sequence of positive integers such that lin) ~ n 
as n — >■ oo. Let k G Z + and p G (0, oo]. Then as n ^ oo we have 

[in,k,p{y,yi{n)),in,k,p{ Z ^yt{n))] ► [£(<>, %c(y)) , f(0, U K {z))\ • (5-19) 

Also, if we choose a chart (U,g) and u G U such that y = g(u), then for almost all 
(fixed) x G M. m , setting v n := u + n _1 / m a; we have as n — >■ oo t/iat 

Wv. ^ WsK), 3^ (B) ) ^> e(o, ?C (3/) ). (5.20) 

Proof. First suppose p = oo. Then the left hand side of (15.191) is equal to 

[£(0, n^i-y + 3V))U(0, ^(-z + y e(n) ))]. 

Also, by rotation invariance the right hand side of ( I5.19P has the same distribution as 
[£(0, H' y K (y\), £(0, H! z K t z \)]- Therefore, under the assumptions given, (15.191) is immediate 
from Lemma 1531 and the Continuous Mapping Theorem ([H], Chapter 1, Theorem 5.1). 
Next we prove (I5.20p in the case p = oo. By (13. 2 j) and translation invariance of £, 

MV, y 9 e (n?)U9(Vn), y\ n) ) = C(0, n 1/m (-y + %(„)) U + g(v n ))}) 

x^n 1/m (-y + g{v n ))^ m {-y + y l{n) ) U {0}) 

= F(nV m (-y + ffW),^-!/ + 3* (n) )) (5.21) 

where for any y G lR d and any locally finite y C M d we set 

y) := £(0, y U {y})£(y, Y U {0}) = £ (0, y U {y})£(0, -?/+(yU {0})). 

By definition, n}l m {v n — u) = x, and by the same argument as for (15.111) earlier on, 
n 1 ' m (g(v n ) — y) converges to g'(u)(x) as n — > oo. Combining this with Lemma [5.1[ we 
have by (15.171) the convergence in distribution 

(n 1/m (g(v n ) -y),n 1/m [-y + y iin) }) ^ (g'(u)(x),U yAy) ). 

By the continuity assumption, for almost every x the point set %' y K , y ^ U {g'(u)(x)} is a.s. 
a continuity point of £(0, •) and so is the point set — g'(u)(x) + (H' y U {0}). Thus, 
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(g'(u)(x), Hy ,s) is a.s. at a continuity point of F, for almost all x. Hence we have the 
desired convergence in distribution (15.201) (when p = oo) by (15.211) and the Continuous 
Mapping Theorem. 

Finally, we consider the case with < p < oo. It is easy to see that 

lim P[^ n ,k,p(y^yt(n))^n,k,p( Z ^yt(n)) ^ £n(v, 3>/(n))fnfo 3^(n))] = 0, 
n— >oo 

so the general case of ( 15.191) follows from the special case with p = oo (already proved) 
along with Slutsky's theorem. The proof of the general case of (15.201) is similar. □ 



6 Proofs of Theorems 3.1-3.3 



Rx(y,y)-- 



We first give some definitions. Assume M 6 M and k G are given, and set 

/C := K,(k). We adapt to the manifold setting the definition of exponentially stabilizing 
functionals [5J [35]. Suppose k G Z + , r G [0, oo) are given, along with the density k. For 
y G fC and locally finite y C JC define 

max[r, N k (X 1/m y, X 1/m y)] if card(^ \ {y}) > k 

A 1//m diam(/C) otherwise. 

Thus if k = then R\(y, y) = r. 

It is easy to see that R := R\(y,y) serves as a radius of stabilization for any £ G 
S(A;,r), in the following sense: for all finite ^4 C (JC \ B x -i/ mR (y)), we have 

a (y, (y n B x - ymR (y)) uA)=Cx (y, y n B x - ymR { v )) . (6.1) 

For all G Z + ,p G (0, oo), note that i? also serves as a radius of stabilization for ^\,k,p 
in the sense that ( 16.1 ft holds if £a is replaced by £A,fe,p- Recall the definition of point 



processes 34 and Pa in Section 12.11 an d recall that S2 is the collection of all subsets 
of JC(k) of cardinality at most 2, including the empty set. Given e > and t > 0, 
we define the tail probabilities for i?A denoted r(£) and r e (t), for Poisson and binomial 
input respectively, as follows: 

r(t) := sup ess sup P[R\(y, V\) > t]; 

A>1 yEK 

r £ (t) := sup ess sup P[R\(y, y n U A) > t] 

A>l,n6Nn((l-e)A,(l+e)A), A&S 2 y&IC 
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where the ess sup denotes essential supremum with respect to the measure K(y)dy. 

Definition 6.1 Given k andr, we say that all £ G S(k,r) are exponentially stabilizing 
for k z/limsup t _^ 00 t _1 logr(i(:) < 0. We say that all £ G S(/c,r) are binomially exponen- 
tially stabilizing for k if there exists e > such that limsup^oot -1 logr e (t) < 0. 

We next show that functionals in E(k,r) have the continuity property of Definition 
15.11 as well as the binomial and exponential stabilization properties. 

Lemma 6.1 Let k G Z + and r > 0. Then every £ G S(k,r) is continuous. If either 
k = or k G F C (A4), then every £ G E(k,r) is exponentially stabilizing and binomially 
exponentially stabilizing for n. 

Proof. To prove continuity, let £ G r), let z G R m , and let F : W 71 — > M. d be linear 
and of full rank. Assume that the points of F('H Z ) have distinct Euclidean norms, and 
that for all n G N there are no points of F(1-C) on the boundary of the ball B n (0). List 
the elements of F(7i) in order of increasing Euclidean norm as as Xi,X2, ■ ■ ■■ Suppose 
(yn)nGN is a sequence of locally finite point sets in M d converging in T to F(l-L), and list 
the elements of y n in order of increasing Euclidean norm as y n ,i,yn,2,yn,3, ■ ■ ■ (possibly 
a terminating sequence). 

Given the realization of H, we pick the smallest K G N such that 

K>mzx(r,N k (0,F(n)),\\F(z)\\). 

Let N denote the number of points of F{l-L) in F>k, and assume (yi, . . . ,y^) lies at a 
continuity point of the mapping (xi, . . . , x^) £(0, {xi, . . . , x^})- By the convergence 
of y n to H a , for all large enough n we have y n ^ G B K and y n ,N+i £ B K , and moreover 
Vn,j — > Uj as n — > oo for all j < N. Therefore by the continuity assumption we have 

f (0, F{H a )) = £(0, {y u . . . , 2/at}) = lim £(0, {y n ,i, • • • , Vu,n}) = lim £(0, F(yn)). 

Similarly, if (-^(-z), j/i, . . . , 2/jv) lies at a continuity point of the mapping (x , £i, . . . ,x^) h > 
£(0, {x , a?i, . . . , xtv}) and if (v n ) ragN is any sequence of locally finite point sets in M. d con- 
verging in T to F(7i z ), then £(0, F(?i*)) = lim^oo ^(0, v n ). Thus, £ is continuous. 
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We prove the exponential stabilization of £ G H(&, r), i.e. the uniform exponential tail 
bound for R\(y,V\), as follows. Suppose y G /C, A G [1, oo) and r < t < A 1//d diam(/C). 
Then 

P[i? A (y,P A ) > t] = P[A r A : (A 1/m ?/, A 1/m "P A ) > t] 

and the last event occurs if and only if the number of points from V\ in B tx -i/ m (y) \ {y} 
is less than k. The number of such points is Poisson distributed with parameter a(t) : = 
a(t, y, A) equal to the Ak measure of B tx -i/ m (y) fl Ai. By Lemma H31 there is a constant 
C*2 > such that we have uniformly in A G [1, oo), y £ JC, and t G (0, A 1 / m diam(/C)) that 
a(t) > t m . Thus by a Chernoff bound for the Poisson distribution (see e.g. Lemma 
1.2 of [34]), there is a constant C 3 such that for max(r, (2kC 2 ) 1/m ) < t < A 1/m diam(/C) 
we have 

P[R x (y,V x ) >t}< kexp(-C^t m ), 

and moreover this also holds for t > A 1 / m diam(/C) since P[R\(y,V\) > t] = in this 
case. This gives the desired exponential stabilization of £ for Poisson input. Modifica- 
tions of this argument yields the exponential stabilization of £ with respect to binomial 
input. □ 

For finite y C R d and y G y, and k G Z + , n G N, p G (0, oo], define 

CUpO/^) := W^3WIW^)I < ^ 5A2 }; ^,,00 := EOJ). (6-2) 

yey 

Recall the (similar) definition of k (y) at (13. 3p . Given n, i, v G N with i < u, define 

Gi,u,n := Hl Kp (y v ) - H*jjy v \ {F,}), (6.3) 

G*, V!n ■= K Ap (y u ) - K Ap {y v \ {r,}). (6.4) 

Lemma 6.2 Suppose £ is binomially exponentially stabilizing, and k G P&(A^). Suppose 
h{n)/n —7-0 as n oo and suppose for some p G N i/ia£ (Iff. 51) holds. Then 

limsup sup E |G I , > „ ) „,| P < oo (6.5) 

n->oo n-/i(n)<i/<n+ft(n) 
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and 

limsup sup E \G* un \ p < oo. (6.6) 

n— >oo n—h(n)<v<n+h(n) 

Proof. We prove only ( 16. 5P ; the proof of ( 16.61) is virtually the same. Write £ n for £ nt k )P , 
and define A a £ n (y, y) := £ n (y, y U {x}) — £ n (y, y). Putting Y = Y v , observe that 

u-l 
i=l 

The pth moment of the first term in the right hand side of (16 ,7p is uniformly bounded 
by (13.51) . The pth moment of the sum in the right hand side of (16.71) is given by 

(u - i)e |A y e„(ri, y„)\ p + (u-i){u- 2)e \a ¥ uyi, ^Ma^qs, y v )\ + ■ ■ ■ 

+ ( (, ( -l-t)! ) E n |Aye " (r " y - )| - (6 - 8) 

Let X, yi, 8^ Ui be as in Section 12.11 Using compactness, let X C X be a finite set such 
that JC C U ieIo BsXyi), and set S := min ie i ((5i). Then 

i/E|A Y £n(n,;K,-i)| p = / / E|A»e„(^^- 2 )|^«(«)^«(y)d2/. (6.9) 

For z,y £ K. with ||z— y|| > 5, by binomial exponential stabilization we have P[A y ^ n (z, y*. 
0] decaying exponentially in n 1//m , uniformly over such (z, y) and over v G [n — h(n),n + 
/i(n)]. By this, the bound (13.51) . and Holder's inequality, the contribution to (16.91) from 
such (2, tends to zero as n — > oo and is uniformly bounded. 

Now take y G .M, and choose i G X such that ?/ G Bg.(yi). Then B 2 s(y) C g%{Ui)- 
Assume without loss of generality that <^(0) = y and let L^' := g^ 1 (Bs(y)). Then the 
contribution to the inner integral in the right hand side of ( 16. 9p from z G B$(y) can be 
rewritten as the expression 

v \ E\AyU9e(n),yu-2)\ p Ki{u)du= [ E \ A y ^ n (g £ (iy' 1/m v),y^ 2 )\ p Ki{i'~ 1/m v)dv 

and by binomial exponential stabilization, and ( 13. 51) . and Holder's inequality and compa- 
rability of norms in Z7j and in g%{Ui) (see Lemma l4~Ti) . there is a constant C, independent 
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of y, such that the integrand is bounded by C exp(— C~ l \\v H 1 /" 1 ), which is integrable in 
v. This shows that ( 16. 9 p is uniformly bounded. 

Turning to the second term in (16.81) . we condition onY = y with i G Tq again chosen 
so that y G BsXlJi)- By Holder's inequality we have 

z/ 2 e \a v myi, y v )\ p - x \A v UY 2 , y v )\ 

= v 2 I [ E \A y £ n (w, ^_ 2 )| p-1 |A^„('u;, y*_ 2 )\K(w)K,(z)dzdw 

JK JK 

< v 2 [ [ (E|A^n(^,^_ 2 )| p ) (p - 1)/p x (E\AyUz,y:-2)\ P ) 1/p KM<z)dzdw. 

JK JK 

The contribution to the last expression from (w, z) Bg(y) xBg(y) is uniformly bounded 
(and in fact tending to zero as n — > oo) by a similar argument to the one given above 
for the contribution to (16.91) from w ^ Bg(y). Assuming ^(0) = y, the contribution to 
the last integral from (w, z) G Bg(y) x Bg{y) is given by 

f [ (e \Ayu9i(v~ 1/m v), y*-2 u {^(^ 1/m ^)})| p ) (p - 1)/p 

Jv 1 / m U' i Ju 1 / m U' i 

x{E\AyU9^- l/m v)^-2^{9i{y- 1,m u)})^ 

By binomial exponential stabilization, Holder's inequality and comparability of norms 
in Ue and in ge(Ue) (see Lemma [4. If) . there is a constant C such that the integrand is 
bounded by C exp (-C-W\u\\ 1/m + |M| 1/m )), which is integrable in (u,v). This shows 
that the integral is uniformly bounded, and hence the second term in (16.81) is bounded. 
The remaining terms in (16. 8ft are handled similarly, showing that all terms in (16. 8ft are 
uniformly bounded and hence by (16.71) . we have (16.51) . □ 

Proof of Theorem \3.1\ We first sketch the proof for q = 2. Using (15.191) we obtain, 
as in the proof of equations (4.3) and (4.4) of |36j, the distributional convergence 

where "H«;(Yi) is a Cox process in M d whose distribution, conditional on the value y of 
Yx, is that of T-L K {y)i an d also, 
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where %k{y 2 ) is an independent copy of the Cox process / H k (y 1 )- 

Put /i := E£(0, 'H K (y 1 )). Under the assumed moment condition it follows that 

lim E^, p (yiJ n )( nAp (y 2 J n ) = E£(0,ft K(Yl) X(0,K (K2 )) = /i 2 , (6.10) 

n->oo 

where the last equality follows by independence. Recalling the definition ( 13. 3ft of i?£ fcp (-)> 
we have that 

n^Etf^Qg 2 = n- 1 Een,fc,p(n,y«) 2 + (1 - r^E^^Yx, ;y n )£ n (Y 2 , X), 

so we obtain from (I6.10j) that n~ 2 E fc „CV n ) 2 /i 2 as n oo. Since n _1 E H^ k (y n ) -)■ 
as n — 7- oo it follows that n~ l H^ k p (y n ) converges in L 2 to /i. 

Since /i equals the right hand side of ( 13.71) . we have ( 13 ,7p with L 2 convergence when 
q = 2. To obtain L 1 convergence when q = 1, we use a truncation argument and follow 
the proof of Proposition 3.2 in [38]. We leave the details to the reader. 

It remains to prove (13.71) holds with a.s. convergence when k G ¥ c (Ai), and (13. 5p 
holds for some p > 5. Lemmas ISTTl and IQ show that E \H* k (y n ) - H* k (3^ n -i)| 5 is 
bounded by a constant that is independent of n. Then (2.11) of [36] holds with f3 = 4/3 
and p' — 5 (and / = 1 in the notation of [36J.) By following the proof of Theorem 2.2 
of [36] we obtain for all e > that 

oo 

5>[|#; fe)P W -E#; fc)P CK)| > ^] < oo. (6.11) 

n=l 

Also, by (13. 4p (i.e., by taking A = in (13.51) ) and (16. 2p and Markov's inequality, 

oo oo 

n=l n=l 

oo 

< ^n^^EO^^i,^)! 5 ] < oo. (6.12) 

n=l 

By (IBTTT]) . (1632]) and the Borel-Cantelli Lemma, 

limn- 1 ( J ffi fc>p (^)-Ei7; fciP (^)) = 0, o.a. (6.13) 

n— >oo ' ' r 

Also, {^ n ,fc,p(^i) 34)? w > 1} are uniformly integrable by (13. 4p . so 

n-^Hl^) -MH* n>kjP (y n )\ < E|a, fc , P (Fi,^.) -C, k>P ( Y iM\ 

= E [\Zn,kA Y liyn)\H\Zn,k,p( Y Uyn)\ > n 5/l2 }\ ^ as 71 ¥ OO. (6.14) 
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Finally, by (13 .7p we have that 

E [^4(W] / E[e(0,^ (?/) )]K(y)^, 
and by (16 . 1 3[) and (16. 14j) we have (13.71) with a.s. convergence. □ 

The following lemma will be used in the proof of Theorem 13.21 It can be proved by 
following verbatim the proof of Lemma 4.2 of [33], so we omit details here. Again we 
write y y for y U {y}. 

Lemma 6.3 Suppose that Ai G M(m, d) and k G F^Ai), and £ is exponentially stabi- 
lizing. Let k G Z + , p G (0, oo] ; and suppose for some p > 2 that £ satisfies i\3.6\) . Then 
there is a constant C > such that for all X > 1 and a// y, z £ fC, 

\E^ Ap (y,V z x )^ k , p (z,V y x ) -E^ Ap (y,V x )E^ Ap (z,V x )\ < C ex V (-C- l X l l m \\z - y\\). 

Proof of Theorem \3.3[ We show first the asymptotic variance convergence (13.171) . 
Recalling from ( \2A\\ that V\ is the Poisson point process on Ai having intensity measure 
Xn(y)dy, we have (cf. the proof of Lemma 4.1 of [35J) 

A^Var^J = / E[^ Ap (y,V x ) 2 Uy)dy (6.15) 

J M 

+ [ [ {E^ p (z,Vl)Uy,n)-^^A P (^W^A P (y^x)}XK(z)K(y)dzdy. 

JM JM 

Since we assume k G F b (Ai), we can choose an index set IcN, and a set of quadruples 
(yi,5i,Ui, gi)i £ z as in Section l2~Tj such that the support k of /C is contained in finitely 
many BsXVi)^ G X := {1, ...,m} C X. Also, we define our partition of unity here by 
ipi = l{B s .(yi) \ Uj. ,/)'., (//,)}. Let 5 := mm ieXo 5 { . 

Suppose y, z G fC with ||z — y\\ > 5. Then by Lemma [6731 the integrand inside the 
braces in the double integral in (16.151) is bounded by CX exp(— C~ 1 (5X) 1 / m ), where the 
constant C does not depend on A, y or z. Hence the contribution to the double integral 
from such y, z tends to zero. 

To estimate the remaining contribution to the double integral, given y take i = i(y) 
such that ipi(y) = 1 (so in partiular B§(y) C gi(Ui)), and let U[ := g^^Bs^y)) and u : = 
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g i 1 (y). Then the contribution to inner integral in the double integral from z G B$(y) is 
given by 



= I F x {v,y)k l {u + \- 1 ' m v)dv 1 

J \ 1 / m (-u+UD 

where we set 

-E^ k , p {y,V x )EU9i{u + >r 1/m v),Vx). 

By the Poisson analog of Lemma l5~4"t together with the moment condition (I3.6p . provided 
y G gi(Ui) is a Lebesgue point of «j, for almost all v G M m we have as A — > oo that 
F\(v,y) — > F(g' i (u)(v),y), where for x G T yo Ai we set 

F(x,y) := EaO,n: (y) )ax,K (y) ) - (EaO,n K(y) )) 2 . 

Also, by (15. 3 p and the assumed a.e. continuity of k, almost every y G gi{Ui) is a 
Lebesgue point of k^. Moreover, by Lemmas 16 . H 16. 31 and 14. 1 [ we have for some constant 
C, independent of and (A,?/), that \F\(v,y)\ < Cexp(— C 1 /™!^!!). Hence, using the 
representation (12 .2\\ with our chosen partition of unity, by dominated convergence the 
double integral in (I6.15P converges as A — > oo to 

£ / / *«(„)(,),,)*(„)*,. (6,6, 



We can simplify this limit by the change of variable w = g[(u)(v). Then dw = D g (u)dv 
and by ( 15. 3ft and then ( 12 .2D the expression (16. 16ft equals 

V / K{yfijji{y)dy \ F(w,y)dw= / / F(w,y)dwK(y) 2 dy, (6.17) 

ie i ^(Ui) JT yo M J M JT y M 

so that the double integral at (16. 15ft tends to this. 

On the other hand, by the Poisson analog of Lemma [5.41 the moment bound (13. 6p . 
and dominated convergence, the single integral at ( 16 . 1 5D tends to 

lim / E[Uy,V x ) 2 Hy)dy= [ E [£(0, H K ( y )) 2 ]K(y)dy. (6.18) 
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Combining the right hand sides of (I6.17P and (16.181) . recalling the definition of at 
(13. 8p . we get from (I6.15P that A _1 Var[_f/'| k ] tends to j M V^(y, K,(y))n(y)dy, which is 
f 13 . 1 TP as desired. 

Now assume (13. 6p holds for some p > 3. To prove the normal approximation result 
(I3.19p . we adapt the proof of Corollary 2.4 of [39] (Corollary 2.1 in the arXiv version), 
putting / = 1. Recall that the partition of unity has been chosen in such a way that 
4>i{y) G {0, 1} for all y G M. and all % G X. For % G 1 let 

Id :={yelC: Uv) = 1}- 

As in Sections 4.2 and 4.3 of [39], let p\ := a log A for A > 0, where a is a suitably chosen 
large constant (see right after (4.9) of [39]). For each i G X , cover the bounded set 
g~[ 1 {K,i) by a (minimal) collection of cubes of side \~ l l m p\, denoted Q it e, 1 < i < V^(A), 
where V t = 0{Xp^ m ). 

Fix A for now. For i G Iq and 1 < i < 14(A), let be the number of points of V\ 
in /Q fl gi(Qi t i), a Poisson variable with parameter given by 

z/ ij£ := A / K.(y)dy = A / R e (x)dx. 

J JCtr\gt(Q ilt ) J gJ 1 {K, e )C\Q i x 

Note that the densities «^ are uniformly bounded because D gt (-) is uniformly bounded 
on g^iJCe) by compactness. 

Let Xj^j denote the jth point of V\ fl /Q fl ge(Qi,e), when these points are listed in 
a randomised order (cf. Section 4.2 of [39]). Then with obvious modifications, Lemmas 
4.2 and 4.3 of [39] still hold in the present setting. 

Now follow Section 4.3 of [39], but now defining the graph G\ := (V\,£\) as follows. 
The set Va consists of pairs (i,£), 1 < i < Ve(\), £ G X , and the adjacency E\ is given by 
{(i, £), (j, £')} G £\ if and only if the distance between ge(Qi,e) H/C and gi'(Qj,e>) H/C is at 
most 2aA _1//m pA- With S!^ defined similarly to S"i in [39J, the variables (5^-^, (i,£) G Va) 
have Ga dependency graph. 

Next we show that the degrees of the graphs G\ to be bounded by a constant, 
uniformly in A. By Lemma 14.11 there exists a constant K such that for all (large 
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enough) A, 

sup sup \\9i 1 (y)-97 1 (z)\\ ^ K (619) 
iax y,zaB Se { yi ) \\y-z\\ 

and also 

diam(^(Qi^)) 

sup sup — j± < K. (6.20) 

eex i<i<Vi{\) A 

If {{i,£), (j,£)} £ £, then dist(Q it e, Qj t i) < 2aK\~ 1 / m px, and for any i the number of 
such j is bounded by a constant. 

Now suppose £ ^ £', and fix % < V^(A). Set £?' := B$ ,(y' e ). Suppose dist(B' , gi(Q i e )n 
K) > (3a + K)A- 1 / m p A - Then for each j < V t >(\), since ge>{Qj,i>) H B' ^ 0, using fl6T20|) 
we have dist((j^(Q^), ge'(Qj/>) > 3aA -1 / m pA, so there are no (j, £') adjacent to (i,£). 

Suppose instead that dist(S', ge(Qi/) fl/C) < (3a + K)\~ l l m p\. Choose w G B' such 
that dist(tu, ge(Qi,e) D £) < (3a + K)\-V m p x . Suppose j is such that {(i, £), (j, f )} e E 
Then by the triangle inequality and (I6.20p . 

djs%{g t >{Qj,e),w) < dist(gi'(Qj/>),gt{Qi/)) + diam(#|(Q^)) + (3a + K)\~ 1/m p x 

< (5a + 2K)\~ 1 / m p x 

so by fl6.19p . dist(Qj^/, gj (w)) < K(5a + 2K)\~ 1 l m p\. Hence the number of such j is 
bounded by a constant. 

Thus the graphs G\ have degrees bounded uniformly by a constant independent of 
A, and we can follow the argument in [39] to complete the proof of (13.1 9ft . 

Finally, if the condition that (13. 6 p holds for some p > 3 is weakened to (13. 6 p holding 
for some p > 2, then we may similarly adapt the proof of Theorem 2.3 of [39] and show 
that when cr 2 (£, k) > 0, the left hand side of ( 13.19P goes to zero, although at a slower 
rate. That is, in this case (I3.18P holds. □ 

Proof of Theorem \ 3.2[ We now prove the variance asymptotics (13.121) . Recall the 
definition of V^(y,a) at (I3.8P and note that the definition (I3.10P gives 

ct 2 (£, «):=/" Vt(y,K(y))K(y)dy-( I 5^y^{y))n{y)dy] . (6.21) 



M \JM 
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The idea here is to follow the de-Poissonization argument in Section 5 of [35] (with 
/ = 1). To ease notation we write £ A for £ XAp and also A*£ A (j/, y) for £ x (y, y x )-£,\{y, y). 
First we seek an analog of Lemma 5.1 of [35] . Set 

71:=/ M{Q,U <y) )K{y)d V] 72==/ K(y) 2 dy [ dvE[A v ^(0,n' yMy) )). 

JM J M JT y M 

We can show by a similar argument to that used already in the proof of Theorem 13.11 
that the analog of equation (5.6) of [35] holds, namely if i ~ A and rh ~ A with £ < rh 
then 

E&QS+i, ^+1)6(^+1,^+1) -> 7i 2 - 

Taking the same partition of unity {^} as in the proof of (13.171) above, analogously to 
(5.7) of [35J we have 

^[^(w^^A^a^,^)] = iJ2 S K ^ d y S M*)<x)<te 

x / K(z)dzE{£ x (y,y^ 2 U{x,z})A z Ux,yi-i)}- (6.22) 

JM 

Using the binomial exponential stabilization and moment conditions, the contribution 
to (16.22p from z ^ gi{Ui) can be shown to vanish as A — > 00. By ( 15. 3D . the remaining 
contribution to f!6.22j) can be written, using the change of variable u = g~ (x) and 

v = ^(z), as 

[ «v)*y I *(•(«))%(«)*. / %(«)* 
xE£ A (y,;V™_ 2 u {^(«),ft(«)})A^6(ffi(w),%-i)- 

By the change of variables w = A 1//m (t> — u), this equals 

-V] / n{y)dy I ^{g^u^R^ujdu / + A~ 1/m w)c/w 



xE [e A (2/, y m _ 2 U + A- 1 ^)}) A^^-^eA^H, tt-i)]- (6-23) 

and by the analog of Lemma 3.7 of [35] (see also (I5.19P and (I5.20p of the present paper), 
along with the moments conditions and the binomial exponential stabilization to provide 
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a dominating function, as A — > oo with i ~ A and m ~ A and I < m, the expression 
(I6.23p . and hence the expression (I6.22p . tends to the expression 

ieX J M JUi JR m 

= 7i72, (6.24) 

which is analogous to (5.13) of [35J. Similarly, iE£ x (Y £+1 , ^)A y ™+ 1 «^ A (F 1 , 34) con- 
verges to 7172, analogously to equation (5.16) in [35] Moreover, as in (5.17) of [35J, 
EA^^njijA^+^ys,^) is here equal to 

ielojelo JUi JlJ i 

ki(x + \~ l/d u)\du / kj(w + \~ l/d v)\dv 

x-y^-x+Ui) Jx-y d (-w+Ui) 

xE A^^^x), y&^W^-^ZxMw), 3^ U fo(<r + A^ n)} ) + (i) 

and by the analog of Lemma 3.7 of [35] (see also (15. 19ft of the present paper), along 
with the moments conditions and the binomial exponential stabilization to provide a 
dominating function, as A — > oo with £ ~ A and rh ~ A this tends to the expression 



EA-JWWeCO,^,)))]*! x / EA^WWeiO,^^^)^ 
£ / MvWyfdy [ EA*ttO,H' yAy) )dz 

/ K{yfdy f EA z aO,U' yMy) )dz) = 7 2 2 - 

^ JTyM J 

Then by arguments similar to those in the proof of Lemma 5.1 of [35] we have a similar 
result here with the squared integral in equation (5.2) of [35] replaced here by the 
expression 

(7i+7 2 ) 2 = (/ 5^y,K(y))dyf 

J M 
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we obtain lim n ^oo n _1 Var[i/"|] = a 2 (£, k). 

Given this, by using Theorem I3.3[ following verbatim the proof of Theorem 2.3 of 
[35] . using the case p = 2 of (I6.5P (in place of Lemma 5.2 of [35]), we can obtain the 
desired (EUll and (jQ2)l . □ 

Remark. The method of proof given in this section shows that our general results can 
be extended to a broader class of £, potentially providing the limit theory for some of the 
statistics mentioned in the last paragraph of Section 1. This goes as follows. Consider 
the class T c of functionals £ which are continuous in the sense of Definition 15 . 1 1 and which 
stabilize over homogeneous Poisson point processes in the sense that (16.11) holds when 
y is a homogeneous Poisson point process and R\(y, y) < oo a.s. Then the above proof 
of Theorem 13. II shows that the conclusion of Theorem 13.11 holds when S(fe, r) is replaced 
by T c . Likewise, if T c {k) C T c consists of £ 6 J c which are exponentially stabilizing 
and binomially exponentially stabilizing for k, then Theorem 13.21 and Theorem 13.31 hold 
when S(fc,r) is replaced by FJ^k). 



7 Proofs of Theorems 2.1 - 2.5 



7.1 Proof of Theorem 1231 

We require some additional lemmas. Observe that the functional Ck(y,y), defined at 
(12. 8p . is scale invariant, namely satisfies (k(y,y) = Cfc( a 2/> c O /7 ) for all a > (cf. Re- 
mark(ii) in Section 3). Recall that for all a 6 (0, oo), T-L a is a homogeneous Poisson 
point process of intensity a in R m . 

Lemma 7.1 Let k > 3. For all a > we have E£fc(0,'H a ) = m and Var[£fc(0, %a)\ = 
m 2 /{k-3). 

Proof. If Nj := Nj(0,7i a ), then conditional on iV^, the random variables (Nj/Nk) m , 1 < 
j < k—1, are distributed as the order statistics of a sample of size k — 1 from the uniform 
distribution on [0, 1] and therefore —m\og(Nj/N k ), 1 < j < k — 1, are distributed as the 
order statistics of a sample of size k — 1 from a standard exponential distribution. Thus 
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the sum U := m Y^j=i l°E(N k /Nj) has a Gamma(/c — 1, 1) distribution and ( k (0, Ti a ) = 
(k-2)mU- 1 . Since Ep' 1 } = (k-2)- 1 and since E [U~ 2 ] = ((k - 2)(k - 3)) -1 , we have 
E( k (0,n a ) = m and EC|(0,H a ) = m 2 (k - 2)/(k - 3), which gives the result. □ 



For all k G Z+, A > 0, p > we put Ca,U^ := (k(X 1/m y, X 1/m y)l{N k (y, y) < p}; 
by scale invariance (\ )k>p (y,y) — (\,k,p(y,y)- The next lemma shows that C\,k, P satisfies 
the moment bounds in the hypotheses of Theorems I3.11I3T21 Recall that for i G Z + , Si 
denotes all subsets of JC(k) of cardinality at most i. 

Lemma 7.2 Let M. G M and k G F c (Ai). With p\ as in Lemma \J7^ we have for 
p G (0, pi), p > 1, k G Z + and i G Z + i/iai 

(7.1) 



sup E [C fc , p (2/, U A) p ] < oo if fc> p + 1 + i; 
sup E [C k , p (y, V x U A) p ] < oo if A: > p + 1 + i. 

j/e/C,A>l,^G5i 



(7.2) 



Proof. We first show ( TfTTT) . Fix p > 1. Let i € (2 P , oo) be such that exp(t 1/p ) < 
1 + 2t~ 1 / p for all t G (t ,oo). Let x G /C, n G N, .A G For j G {1, k - 1}, let 



N s := NjfaynUA). Then (^(y, y n U X) = (k - 2)(J2 k r 1 l log(N k /N j ))' 1 l{N k < p} 



and 



(k-2)- p E^ p (y,y n UA) p = E 



k-l 



j;iog|m<p} 



<to+ / P 

'to 



fc-1 



AT, 



TV, 



^io g ^ri{iv fc <p}>t 

L i=i J 



dt. 



Conditioning on A^ we obtain 



P 



k-l 



N, 



£log^ <t-*\N h 



u 



dP^ k {u)dt 



np 
P 



log^ <t- 1 '*\N h = u 



dP Nk (u)dt, (7.3) 



and by choice of to, for all t G (to, oo) we have 

P [log{N k /N!) < r^lN, = u]<P [(N k /Nx) < 1 + 2r l ' p \N k = u] 

= P [Ni > N k /(1 + 2t- l ' p )\N k = u] 
<P[N l >N k {l-2t- l / p )\N k = u}. 



(7.4) 
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Given N k = u, there are k — 1 — card(*4. fl B u ) points of y n in the interior of B u (y), and 
to have N\ > u(l — 2t~ 1 ^ p ) it is nessecary for these points to all lie in A tjU (y), where we 
set A t , u (y) := B u (y) \ B u{1 _ 2t -i /p) (y). For u G (0,p] and p < p 1 , Lemma S3] gives 

/ dy < CxivT - (u(l - 2t- l ' p )) m ) < 2 m C 1 mr 1/p w m , 

JA t , u (y)nM 



PIN, > N k (l - 2^)1^ = u\ < ( fp*M™ K ^ dz \ < C ( r i/ P )*-i-<. (7.5) 



and f B Kr yi ) dy > C 1 M m . Thus by the boundedness assumptions on k, for all w e (0, p], 

fc-l-i 

Combining this with f lT.3 j) and (I7.4p yields 

poo 

(k-2)-PE( kjP (y,y n UA) p <t + / Cit-^f-^dt, 

J t 

which is finite and independent of n if k > p + 1 + i. This gives us (17. ip . and the proof 
of ( 17.21) is just the same. □ 

Proof of Theorem \2.1[ First we prove (12.111) . Given p > 0, we have 

P[m k>p (y n ) m k (y n )} < P[U,< n {iV fe (K t , y n ) > p}\ < nP[N k (Y n , y n ) > p\. (7.6) 

Now N k (Y n , y n ) > p if and only if there are at most k — 1 points of y n -i in B p (Y n ). By 
Lemma [4.31 there exists p > such that P\Y\ G B p (y)] > p for all y e K. Hence by a 
Chernoff-type large deviations estimate for the binomial distribution (see e.g. Lemma 
1.1 of [31]), provided (n — l)p > 2{k — 1) we have for some constant C independent of 
n that 

nP[N k (Y n ,y n )>p}<nexp(-C-\n-l)p) 

which is summable in n, so that assertion in (12. lip follows by (17. 6p and the Borel-Cantelli 
lemma. 

We now prove the remaining assertions of Theorem 12.11 with pi as in Lemma 14.31 It 
suffices to show that ( k ,p, k G N, as defined at ( 12. 9|) . satisfy the conditions of Theorems 
I3.1H3.2I Since ( k is a continuous function of the k nearest neighbor distances, it belongs 
to the class E(k,r). Lemma IT21 establishes that ( ktP satisfies the moment condition 
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( 13. 4p when k > p + 1. It follows by Theorem 13. II with q = 2 there, and Lemma [7. II that 
lim^oo m kjP (y n ) = m in L 2 . Combined with (12.111) . this implies the stated convergence 
in probability (12. 12ft for rhk{y n )- 

If also k > 10, then by taking i = 3 in ( 17. ip . £fc )P satisfies (13. 5ft for p = 5.5. Hence by 
Theorem 13.11 we have lim^oo rh kjP (y n ) = m almost surely. Combined with (12. lip this 
gives (I2.12p with a.s. convergence. 

To obtain the limits (I2.13P and (I2.14p . observe that for k > 7, the moment bound 
(17. ip (with i = 3) is satisfied for p = 2.5, so taking £ = ( k we have conditions (13. 5 p 
and (13. 6 p with p = 2.5. Therefore we can apply Theorem 13.21 for this £, and since it is 
scale invariant, by ( I3.14p and Lemma 17.11 the limiting variance o" 2 (0c,k) appearing in 
Theorem O equals the right hand side of f l2~7L3l) . so fl2TT3l) and (12141) follow. Finally, 
Lemma 17.31 completes the proof of Theorem 12.11 □ 

Lemma 7.3 With a\ given by Ii2.13\) . it is the case that a\ > 0. 

Proof. Since they are given by ( 12. 5p and ( 12. 6p . the expressions V^* and 5^ k in ( 12. 13[) 
depend only on m and not on d or k. Hence by using the part of Theorem 12.11 that we 
have already proved, in the special case where d = m and k is a uniform distribution on 
the unit cube in M. m , we have a\ = lim n _ >QO nVaT[rhk tP (X n )], where X n is a point process 
consisting of n independent uniformly random vectors in a unit cube in R m . Strictly 
speaking, the unit cube is not a submanifold-with-boundary in M m , but the geometric 
estimates we have used hold for the unit cube, so the proof of Theorem 13.21 carries 
through to this case. By the definition (12.71) and scale invariance of we have 



xex n 

This can be done, either by the method of Penrose and Yukich (Section 5 of [37J), or 
by the method of Avram and Bertsimas (Proposition 5 of [2] , which can be adapted to 




xex, 



and so it is enough to show that 



lim inf n Var 




46 



binomial input). The particular functional under consideration here is not considered in 
those references, but the general approaches are well known so we omit further details. 
□ 



7.2 Proof of Theorems [231 - 1231 



Recall that Y{ are i.i.d. with density k and that Ni(y,y) is the Euclidean distance 
between y and its nearest neighbor in y, or +oo if y \ y is empty. To help deal with 
the possibility that a Poisson process V\ has no elements, define 



The proofs of Theorems 12.21 - 12.41 depend in part on the following lemmas. 

Lemma 7.4 Suppose k G ¥ c (Ai). There is a constant C such that for all n > 3, A > 1, 
n > 4 and £ G [n/2, 3n/2], y G K, A G S 3 and t G (0, oo) we have 



Proof. These bounds can be deduced from the proof of Lemma 16.11 but we prefer to 
argue directly, as follows. Letting a(t,y,n) be the nn measure of B _ 1/m (y), we have 

P[N 1 {n 1 / m y,n 1 ' m {y^A)) > t] = (1 - a(t,y,n)) e < exp(-£a(t,y,n)/n). (7.9) 

By Lemma 14.31 there is a constant C such that uniformly in n > 3, y G /C, and 
t G (0, A), we have a(t,y,n) > C~ l i m ', which gives (17. 7p for t < A, and clearly (17. 7p 
holds for t > A. The second bound (17. 8p is proved similarly. □ 



Lemma 7.5 If k is bounded and 5 G (0,m) ; then sup n E [iVi(n 1//m Y"i, n l l m y n ) 5 ] < oo. 
Proof. Set F ntV (t) := P[N x {n 1/rn y, n x l m y n „i) < t]. Then 




P[N l (n 1 / m y,n 1 / m (y i UA))>t}<exp(-C- 1 t m ); 
P[Ni(\ 1/n y,\ 1/m P\) >t]< exp(-C-H m ). 



(7.7) 
(7.8) 
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As in (17. 9p . we have for n large and all y G JC, t G (0, 1) that 

PliVx^ 1 /^, n 1 /" 1 ^-!) > t] = (1 - y, n))"" 1 > exp(-2(n - l)a(t, n)) 

where a(t,y,n) < Ct m /n by Lemma [4. 3[ Thus for t G (0, 1) we obtain that 

F nty (t) = 1 - P[A^ 1 (n 1 / m ?/,n 1 / m y„-i) > t] < 1 - exp(-3Ct m ) < 3Ct m . (7.10) 

Hence by Fubini's theorem we have for all y G /C that 

/>oo /*oo ^1 roc 

/ r 5 dF n!y (t) = 5 r^F^y^dt <5C f^^dt + 5 t- s ~ l dt, 
Jo Jo Jo Jl 

which is finite if 5 G (0,m). Integrating over y G /C gives the result. □ 

Proof of Theorem 12. 2L Let g = lorg = 2. Ifa>0 and k G F c (JA), then Lemma 
El shows that sup n E [N 1 (n 1 / m Y 1 , n 1 / m y n ) ap ] < oo for all p > 0. If a G (-m/q,0) and 
k is bounded, and if then p > q is chosen so that — m < < then Lemma 17.51 gives 
E[N 1 ( y n 1 / m Y 1 ,n 1 ^ m y n ) ap } < oo. 

Thus, in all these cases the moment condition ( 13. 4ft holds for £ = iV{* and some 
p > q. Since iVf belongs to the class H(fc, r), the limit (12.171) (with L q convergence) thus 
follows from Theorem 13.11 the fact that is homogeneous of order a (see (13.131) ). and 
the identity E [iV"(0, %)] = mu m J °° ti a + m_1 exp(— u m u m )du which yields 



E[N?(0,H)] = 




(7.11) 



see also (15) of Wade jl?]. Moreover, in the first case (a > and k G P c (.M)), Lemma 
17.41 shows that the moment condition ( 13. 5 j) holds for p > 6 and so by Theorem 13.11 we 
obtain the a.s. convergence at (I2.17p . This concludes the proof of Theorem 12.21 □ 

Proof of Theorem 12.31 First assume a > 0. To prove variance asymptotics and 
( I2.19p it suffices to show that the functional iVf satisfies the conditions of Theorem 
13.21 Since we assume k G ¥ c (Ai), Lemma [7.41 is applicable, showing that the moment 
conditions (13. 5p and ( 13. 6 p hold when £ = N®. The conditions of Theorem 13.21 are all 
satisfied, so that result gives variance asymptotics (I3.1ip and the central limit theorem 
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f)3.12p for £ = N" . Also, this choice of £ is homogeneous of order a, so that (I3.14p is 
applicable with (3 = a, and applying this identity to ( 13.1 ip and (" 13 . 12[) gives the results 
(T2TT8D and (127L91) for a > 0, subject to showing that a 2 > in (12TT81) . 

Now assume a G (—m/2,0). We cannot directly apply Theorem 13.21 because the 
moment bound (I3.6P fails since if A = {z}, the distance between y and z can be made 
arbitrarily small and thus N" x (y,V\ U {z}) can be made arbitrarily large. Instead we 
use a truncation argument and follow the approach of [3]. Put 4>(x) = x a for x > with 
0(0) = 0. Given e > define the functions 

^\x):=h ix) {{X - £ 
otherwise 

and (£) (x) := 0(x) - 0( £ )(x). Let N?°(y,y) := 0^(^i(l/^)). 

Let £ > 0. Then the moment bounds ( 13.61) and ( 13. 5p (with p = oo) hold for £ = A^f ' e 
for p = 3 (say), because 4>^(y, y) < e a for all y, y. Thus we may apply Theorem 13.21 
to deduce that as n — > oo, n~ l Vax[H n 1 (34)] converges to a 2 (N^' 6 , k) and 

n- 1/2 (H*?'\y n) -EH^\y n )) A AT(0, ^(iV^, k)). (7.12) 

To complete the proof, we adapt arguments in [1], given in detail in J3], as follows. To 
make the link with [3], for 1 < i < m we set A^ >m := A^i(Fj, ^m), giving the identification 

^n(^^n)=0(niV-). 

The equivalent of Lemma 5.1 of [3] is given by Lemma 17.61 below. Let us assume 
this for now. Lemma 5.2 of [3] remains valid here modulo some small notational mod- 
ifications; the % featuring in that result should be considered now as a homogeneous 
Poisson process in M m . Likewise, Lemma 5.3 of 0] carries over with straightforward 
modifications. By the proof of Lemma 5.5 of [3], we can show that 

lima 2 (A>f' e , re) = a 2 (N?, re). (7.13) 

With (17.131) established, the variance asymptotics (12.181) and central limit theorem (12.191) 
follow follows along the lines of the proof of Theorem 2.1 of [3J. 

Now assume either a > or a G (—m/2,0). To show positivity of the limiting 
variance in ( 12.181) we would like to follow the approach used to show positivity of a\ 
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in Lemma 17.31 In this case we do not have scale invariance. However, since k is a 
probability density function, for any £ G E.(k,r) we have from f 1 3 . 1 j) and Jensen's 
inequality that 

v 2 (Z,k)> [ {V^K(y))-5^K(y)) 2 }K(y)dy, (7.14) 

J M 

so it suffices to show that when we take £ = iV", the integrand in the braces in the right 
side of (17.14ft is strictly positive. By what we have already proved, 

V N ?(a) - (5 N "(a)) 2 = lim n - l VarR a (n l/m X n ) 

n— >oo 

where now X n is a point process consisting of n independent uniformly distributed 
random vectors in a cube of volume a -1 in M. m . This limit can be shown to be strictly 
positive by using the methods of [2] or [37]. 

Finally we prove (T2T20]) . By Q22SJ) and (17TTT]) we have 

V 2 / m 

where for x G W m we here set f(x):= EiVf (0, W) -EN?(0,H), so that 

/(*) = / (||a:|r-||y|r)exp(-a; m || 2 /|r)^ 

</R m VB M (0) 

and hence by Fubini's theorem 



f(x)dx= I c?2/exp(-u; m ||2/|| m ) / (||x|| a - |||/|| a )(ia; 

= ( -au 2 m m \ r r a + 2m-i exp (_ Umrm)dr 
V a + m J J 

T(2 + a/m) = -(a/m)oo m a/m T(l + a/m). 



—a/m^ 
dUrn 



a + m 

Substituting back into (17.151) gives us (12.201) . concluding the proof of Theorem 12.31 □ 

We state the equivalent of Lemma 5.1 of [3J. Recall N ijm := Ni(Yi, y m ). 

Lemma 7.6 Let <fir e )(') be as in the proof of Theorem \2.3[ Given 5 > there exists 
£o > and > such that for e G (0, Eq) andn > n we have Var J2i=i 0(e)( n -^ln) — ^- 
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Proof. We may proceed as in [3] with only minor modifications as far as (5.4) of [3], 

but more effort is required to adapt the proof of (5.19) of [3]. To do this, set Y = Y n+1 

(corresponding to X in p]), and for 1 < j < J define the open cones Wj with vertex Y, 

just as in [3] (these cones cover ~R d , not just the tangent hyperplane at Y). 

Let Ij^ n := l{y n H Wj(Y) ^ 0}. If Ij >n = 1 then set Z^ n to be the nearest neighbor 

of Y in y n n Wj(Y); otherwise set Zj >n := Y . Set and Rj >n := \\Zj >n — F||. Noting that 

|</>( £ )(-) I is nonincreasing on (0, e), we have as in (5.5) of [3| that 

j 

i=i 

Using the last inequality in ( I4.14p of the current paper, we have, similarly to (5.6) of 
[3], that there is a constant K$ such that P\R^ n > r] > (1 — K^r) 71 , and therefore there 
is a constant K-j such that for < t < 1 and large enough n we have 

P[nRf n > t] > (1 - ir 6 t/n) n > exp(-K 7 t). 

We may follow the argument given after (5.7) of [3] to obtain the analogue to (5.9) of 
[3] (with N™ n instead of the D 9 ink of j3]). We can then continue as in the proof of the 
case JC = M. d of Lemma 5.1 of [3], to conclude the proof. □ 



<2j2hn\<f>(d nR ™n)\- ( 7 - 16 ) 

i=i 



Before proving Theorem l2.4l we need some preliminary results. Recall that ip(y, y) '■— 
log{e~< u m N™{y, y)). If y \ {y} = 0, let us set ip{y, y) := 0. For all a e (0, oo) we claim 
that 

POO 

Eip(0,n a ) =7 + a / (log u)e- au du = -logo. (7.17) 

The first equality of (17.171) follows since the probability that the volume of the near- 
est neighbor ball around exceeds u is e~ au , and the second equality follows since 
J °° \og(w)e~ w dw = —7 (see e.g. [22], page 107). Also, since J °°(log w) 2 e~ w dw = 
7 2 + 7r 2 /6 we have 

POO 

E^j(0,n a ) 2 = / (7 + \ogufae~ au du 
Jo 

POO 

= 7 2 + 2(-7 - log 0)7 + a / (log ufe~ au du 

Jo 

= tt 2 /6 + (log a) 2 . (7.18) 
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Let and V^(a) be given by setting £ = if) in (I2.5P and (13.81) . respectively. 
Lemma 7.7 It is the case that a 2 (ip, k) = — vrr 2 + Var[log(Ac(Yi))]. 
Proof. Using ( 13. 8ft . (17. 17ft . and ( 17. 18ft . and setting t> = a 1//m w, we have that 
V*(a) - (log a) 2 - tt 2 /6 = / {e^O^^M^^X) - (E^(0,H a )) 2 ) dv 

Jr™ ^ J 

and since 7i a = a~ 1 l m 'H in distribution, the last displayed expression is 

= / {EiPiO^-^n^ipia-^v^-^n ) - (EiP(0,a- 1/m n)) 2 } dv. 
Jr™ 

By definition we always have if>(ty, ty) = if}(y, y) + mlogt, so the preceding display is 

{E[(if>(0,n v ) -loga)(if)(v,H°) - log a)] - (Eif>(0,H) - log a) 2 } cfc 

= V^-7r 2 /6-2(loga)<^, 



so 



V^(a) = (loga) 2 + U^ -25* log a. (7.19) 
Using (13. 9p . and (I7.17p . and setting v = a l / m u, we have that 

<^(o)+logo= / E[^(0,-Hf 1/m,; )-^(0,^a)]^ 

Jr™ 

= [ E[iP(0,a- 1/m n v ) -ip(0,a-^ m n)}dv = S^. 

Setting Iij(k) := J M (log n(y)y n(y)dy for j = 1,2, we may use (I3.10p and (I7.19P to 
deduce that 

a 2 (ip, k) = V^ + I 1>2 (k) - 25*I 1)1 (k) - (^ - ^(k)) 2 

= ^-(^) a + J 1|2 (K)-i? >1 («). (7.20) 

Moreover, by (12. 6p and (I7.17P we have 8^ = J Rd f(x)dx, where here we set 

f(x) := -mE ^og{N 1 {Q,U)/N 1 {Q,U x ))] 

POO POO 

= - P[Ni(0,H) > WxWe^dt = - exp(-uj m \\x\\ m e tm )dt 
Jo Jo 
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so that setting v = u; m ||x|| m we have 

/*OG POO POO 

-5* = I / exp(-ve tm )dtdv = / e~ tm dt = rrC 1 . 
Jo Jo Jo 

Substituting this into (I7.20p gives the result claimed. □ 

Lemma 7.8 Suppose either (i) that k G Fb(Ai), or (ii) that m = d and A4 = M. d and 
r c (/c) > 0. Then there exists 5 > such that sup^E [N^n^Yi, n 1 /"^) 5 ] < oo. 

Proof. First consider case (ii). Choose 5 small enough so that r c («) > d~d/(d — 5). 
Then by the proof of Theorem 13.3 of [30] (Theorem 2.3 in the arXiv version) (with S 
corresponding to ap in that proof), E [Ni(n 1 ^ m Yi, r2, 1,/m 34) 5 ] is bounded as asserted. 

Now assume case (i) instead. We adapt the proof of Theorem 13.3 of [30J to man- 
ifolds. Let ((yi, 6i, Ui, gi) , i G X ) be as in Section I2.1[ and take a finite X C X 
such that K C Uigx BsXUi)- Assume X = {l,...,z'o} for some io, and write Ai for 
BsM \ U ; . , ^.v !//,)• For any finite y write L 5 (^) for Z ye y Ni(y, 3>) a with L 5 (^) = 
if card(J^) < 1. Similarly to (3.4) of [3D], we have 

E[iV 1 (n 1 / m r 1 ,n 1 / m y n ) <5 ] =n tf / m - 1 E[L 5 (y„)], (7.21) 

and using the boundedness of K we have, similarly to (3.6) of [30], 

L\y n ) < L5 (yn n A) + C. (7.22) 

i=l 

Similarly to (3.7) of [30], by combining ( 17.2 ip and ( 17.22p we have 

io 

^L 5 (y„n^) + c 

,i=l 

and by Jensen's inequality this remains bounded, provided we can establish the deter- 
ministic bound, for all finite y C Ai, 

L 5 {y) < C(card(^)) 1 " 5/m . (7.23) 

By Lemma 14.11 there is a constant C such that for all finite y G Ai and x G y, 
taking y to be a nearest neighbor of x in y, and also taking g^ l (z) to be a nearest 
neighbor of g~[ (x) in 5'~ 1 (3^), we have that 

N 1 (x,y) = \\y-x\\ < \\z-x\\<C\\g-\z)-gr\x)\\=CN 1 {gr\x) 1 gr\y)). 

53 



E[JVi(n 1/m y ls n 1/m y„)*] =n s/m - 1 E 



Hence, L s (y) < CL s (g i and using Lemma 3.3 of [18] . we have (I7.23P as asserted. □ 

Proof of Theorem 12.41 To prove the L 2 convergence at (I2.2ip we shall apply 
Theorem 13.11 Note that ip belongs to E(k,r) and thus it suffices to verify under either 
of the hypotheses of Theorem I2.4[ that £ = ip satisfies the moment condition ( 13 .4[) for 
p = 3. Set Ni := iV 1 (n 1 / m Y 1 , n l / m y n ). It will suffice to show that sup n E | \ogN^\ p < oo. 
Given 5 G (0, 1), we can choose a constant C such that | log t| p < Ct~ s for t G [0, 1] and 
| logt| p < Ct s for t G [1, oo). Then 



By Lemmas 17.51 and I7.8[ the right hand side of (I7.24p is bounded provided 5 is chosen 
small enough. Hence we can apply Theorem I3.1[ yielding 



lim^^^ Ety(0,H K{y) )] K (y)dy = - / log( K (y)) K (y)dy in L 2 , 



by f TTTTH . Thus ( 12T2TT) holds. 

Now we suppose k G P c (.M), and prove the variance asymptotics ( 12 .221) and central 
limit theorem (I2.23p . We cannot directly apply Theorem 13 . 21 because the moment bound 
(13. 6 p fails since if A = {z}, the distance between y and z can be made arbitrarily small. 
As in the proof of the case a < of Theorem 12 .31 we use a truncation argument, defining 
for all e > the function 



Let tjj 6 be defined as t/>, with log replaced by log . 

As in the proof of Theorem 12.31 we may show that the moment bounds (I3.6P and 
( 13. 5 p hold (with p = oo) for tp £ for some p > 2. The case A = of (13 .6p follows from 
the bound 



E | log iV7| p < CE [N{ s + N(] 



(7.24) 






sup E\^(X 1/d y,X 1/d V x )\ p < oo, 



(7.25) 



a>i, ye/c 
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which is proved similarly to (13. 4p which we have already established. For the case 
A = {z}, observe that 

r(\ 1/m y,\ 1/m (VxU{z})) = r(X 1 / m y,X 1/m V x )l{N 1 (y,V x ) < \\y - z\\} 

+ \og^{^oj m X\\y-z\\ m )l{N 1 {y,V x ) > \\y-z\\}. (7.26) 

The first term in the right hand side of (I7.26P has bounded pth moment, which may 
be proved similarly to the proof of ( 13.41) . Since the function |log(-)| is decreasing on 
(0,1) and increasing on (l,oo), provided e < 1 the absolute value of the last term 
in (17.261) is bounded by | log(e)| if e 7 c<; m A||?/ — z\\ m < 1, and otherwise is bounded by 
\ip e (^\ 1 / rn y i X 1 ^ m Vx)\, which has bounded pth. moment, as noted already. This gives us 
(13. 6p . and the argument for (13.51) is similar. 

By Theorem 13.21 as n — > oo, we deduce that n~ 1 Vax[H'!jf {y n )] converges to (T 2 (^ £ , 
and 

n-W{H? (y n ) -EHf (y n )) A Af(0,cr*W, K )). (7.27) 

To complete the proof, we adapt arguments in [3], given in detail in [3], as follows. 
To make the link with [3J, put <f)(x) := log(e 7 u; m x) in Section 5 of [3], and also for 
1 < i < m, set N i>m := Ni(Yi, y m ), giving the identification ip n (Yi,y n ) = 0(niV™). Set 
log( £ )(x) = log(x) — log^x) and 4> £ {x) := log^e^x). It is easily seen with this choice 
of 4> that Lemma EE] holds (follow that proof verbatim, choosing e small enough so that 
|0 e (-)| is nonincreasing on (0,e)). It now suffices to follow the proof of Theorem 12.31 for 
the case a G (— m/2,0). 

It remains only to show <J 2 (tp, k) > 0. We again follow the approach used in the 
proof of positivity in Theorem 12.31 By (12.221) . it is enough to show that the expression 
yip — (£^ 2 ? j s strictly positive. This can be shown to be nonnegative as in the proof of 
positivity in Theorem 12. 3[ and we leave the details to the reader. □ 



Proof of Theorem [2J3 Let E (0,oo). Let <p k {y,y) := (p^\y,y) be (k + l)" 1 
times the number of fc-simplices containing y in TZ^^y), i.e. (k + l) -1 times the number 
of unordered (k + l)-tuples of points in y, all pairwise within /3 of each other, and 
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including y. Then 

cf(y) = E^(^)- ( 7 - 28 ) 

yey 

We want to show that tp k satisfies the conditions of Theorems 13.11 and 13.21 Note that 
V? fc G 3(0, r) if r > /3. For any y G JC, A G S 3 and k,l G N, (pk(n^ m y,n^ m (yi U .A)) is 
bounded by (3 + C /3 (n)) fc , where we set C^(n) := £Li l{p< - *i|| < fin- 1 /" 1 }, which is 
a binomially distibuted with parameters £ and the k measure of Bg n -i/ m (y). Assuming 
K is bounded, there is a constant C such that the latter parameter is at most C£~ l , 
uniformly in y. Hence Cp{n) is stochastically bounded by a binomial random variable 
with parameters i and C£~ x and thus for any p > 1, ip k satisfies the moment condition 
(13. 5p , and hence also (13 .4[) . Therefore by using Theorem 13.11 and (17.281) , we have 

\imn- 1 Ct{n l l m y n )=\imn- 1 H^{y n )= I E^f {Q,U K{y) )K{y)dy, (7.29) 

n— >oo ' n~ >oo J JV[ 

with both L 2 and a.s. convergence. 

Define hf : (W m ) k+1 ~+ K by fcf (a*, . . . , x k+1 ) := fL^xfc+i Mll^i ~ < Ph 
i.e. the indicator of the event that Xi, . . . , x k +i are all within distanace f3 of each other. 
By the Palm theory of Poisson processes (e.g. Theorem 1.6 of [31]) we have 

E^\0,n f x ) = —^— [ hf(0,xi,...,x k )dxi...dx k 

(A/3 m ) fc f f (i) (A/3 m ) fc 
= 71 , 7T, / •••/ h\ I '(0,x 1 ,...,x k )dx 1 ...dx k = — — T Jfc,fc+i, 
(fc + i)!y Rm y Rm (fc + l)! 

where the last equality comes from (I2.24|) . Combined with (I7.29p . this gives us (I2.26p . 

A slight modification of the above argument shows that (p k satisfies the Poisson 
moment condition (13.61) for any p > 1. By Theorem 13.21 the variance asymptotics (12. 27ft 
and central limit theorem (I2.28j) follow, with a k (/3, k) := a 2 {ipf\ k) given by (13. 10p . We 
need to show that this is consistent with (I2.25p . 

The expression 5 lfik (a), given by ( 13. 9ft , simplifies further as 

8^{a) = {arfJkMi + a I E [A> fc (0, H a )]du. (7.30) 

</R m 

Using the Palm theory of the Poisson process again, we have for all u G M m that 
E[A> fc (0,^)]= °* 1 / ■■•/ hP(0,u,x 1 ,...,x k )dx 1 ...dx k „ l . 
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Together with (I7.30p . this gives 



aP m ) k Jk,k + i + 



a k k 



(Jfe + 1)! 



Xi, 



Xk)dxi ■ ■ ■ dxi, 



(k + l)(arfJ k , 



Jfe+1- 



(7.31) 



For the first term in (13.101) we simplify the expression V Vk (a) as follows. Consider the 
special case where m = d and M. is a smoothly bounded region of volume a -1 , and let 
V' x be a homogeneous Poisson point process in this region with expected total number 
of points equal to A. By applying (13. 17j) in this case, recalling notation c£ from Section 
|2~H we get that 



V Vt >{a) = lim A -1 Var V ip k (X 1/m y, X l/m V' x ) = lim A _1 Var[cf \\ 1/m P' x )] 

A^oo ' J \— i>oo 

lim A-VarfCf {a/A)1/m) (a 1/m n)]- 



Hence by Proposition 3.7 of 



A— >oo 

setting k' = k + 1 we have 

k> 

3=1 



Using this identity in the first term of (I3.10p . and using (I7.3ip for the second term of 
(I3.10p . enables us to establish the identiy (I2.25p . 

It remains to show that a 2 (ipf\ k) > 0. This can be done as in the proof of Lemma 
I2.3[ i.e. using (I7.14p to reduce the problem to showing positivity in the case where 
d = m and k is a uniform distribution on a cube, and using the methods of [2] or [37] 
to demonstrate positivity in this case. This concludes the proof of Theorem 12.51 □ 
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